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TITLE 

GENETIC MARKERS AND METHODS FOR THE DETECTION OF 
LISTERIA MONOCYTOGENES AND LISTERIA SPP 
This is a continuation-in-part of Application No 08/745228, filed 
5 8 November [996 

FIELD OF IN VENTION 
The invention relates to the field of molecular biology and the use of 
randomly amplified nucleic acid fragments (RAPD) for the selection of genetic 
markers useful in the identification of bacteria. More specifically, the invention 
10 relates to specific DNA marker sequences useful for the detection of Listeria 

monocytogenes and Listeria spp. and us-, jf those diagnostic markers to determine 
if an unknown bacterium is a member of either Listeria monocytogenes or Listeria 
spp 

BACKGROUND 

15 Central to the field of microbiology is the ability to positively identify 

microorganisms at the level of genus, species or serotype. Correct identification is 
not only an essential tool in the laboratory, but it plays a significant roie in the 
control of microbial contamination in the processing of food stuffs, the production 
of agricultural products, and the monitoring of environmental media such as 

20 ground water. Increasing stringency in regulations that apply to microbial 
contamination have resulted in a corresponding increase in industry resources 
which must be dedicated to contamination monitoring. 

Of greatest concern is the detection and control of pathogenic 
microorganisms. Although a broad range of microorganisms have been classified 

25 as pathogenic, attention has primarily focused on a few bacterial groupings such as 
Escherichia, Salmonella, Listeria and Clostridia. Typically, pathogen 
identification has relied on methods for distinguishing phenotypic aspects such as 
growth or motility characteristics, and for immunological and serological 
characteristics. Selective growth procedures and immunological methods are the 

30 traditional .r^thods of choice for bacterial identification and these can be effective 
for the presumptive detection of a large number of species within a particular 
genus. However, these methods are time consuming and are subject to error. 
Selective growth methods require cuituring and subculturing in selective media, 
followed by subjective analysis by an experienced investigator. Immunological 

35 detection (e.g., ELISA) is more rapid and specific, however, it still requires 
growth of a significant population of organisms and isolation of the relevant 
antigens. For these reasons interest has turned to detection of bacterial pathogens 
on the basis of nucleic acid sequence. 

I 
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(l is well known, for example, that nucleic acid sequences associated with 
the nbosornes of bacteria are often highly conserved across genera and are 
therefore useful for identification (Webster, U S Patent No 4,717,653 and U S 
Patent No 5,087,558; Enns, Lab. Med.. 19, 295, (1988). Mordarski, Soc. Appl. 
5 Bacterwl. Tech. Ser , 20 (Chem. Methods Bact. Syst. ), 4 L ( 1985)) VVeisburg et 
al (EP 5 1 736) disclose a method for the detection and identification of pathogenic 
microorganisms involving the PCR amplification and labeling of a target nucleotide 
for hybridization to I6S rDNA off. coli. Lane et al. (WO 9015157) teach 
universal nucleic acid probes that hybridize to conserved regions of 23 S or 16S 
10 rRNA of eubacteria. 

Although bacterial ribosomal nucleic acids contain highly conserved 
sequences, they are not the only sources of base sequence conservation that is 
useful for microorganism identification, Wheatcroft et al. (CA 2055302) describe 
the selection of transposabie elements, flanked by unique DNA sequences, for the 
15 detection of various Rhizobium strains. Similarly, Tommassen et al. 

(WO 901 1370) disclose polynucleotide probes and methods for the identification 
and detection of gram-positive bacteria. The method of Tommassen et al. relies on 
probes corresponding to relatively short fragments of the outer membrane protein, 
OmpA, which is known to be highly conserved throughout gram-positive genera. 
20 Atlas et al. (EP 5 17154) teach a nucleic acid hybridization method for the 

detection of Giardia sp. based on designing probes with sequences complementary 
to regions of the gene encoding the giardin protein. Webster et al. (U.S. Patent 
No. 4,717,653) has expanded upon the use of rRNA in disclosing a method for the 
characterization of bacteria based on the comparison of the chromatographic 
25 pattern of restriction endonuclease-digested DNA from the unknown organism 
with equivalent chromatographic patterns of at least 2 known different organism 
species. The digested DNA has been hybridized or reassociated with ribosomai 
RNA information-containing nucleic acid from (or derived from) a known probe 
organism. The method of Webster et al. effectively establishes a unique bacterial 
30 nucleic acid "f^erprint" corresponding to a particular bacterial genus against 
which unknown ''fingerprints" are compared. 

Methods for the identification of Listeria monocytogenes on using specific 
hybridization probes or primers are known. For example, U.S. 5523205 and 
JP 05219997 teach DNA probes capable of hybridizing to a portion of the genome 
35 of pathogenic Listeria monocytogenes, but do not hybridize to genomes of other 
Listeria species. DE 4238699 and EP 576842 teach methods for detection of 
Listeria monocytogenes using primers designed to give amplification products 
specific to the monocytogenes genome. EP 576842 discusses a method for the 
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detection of L. monocytogenes using amplification primers based on genes 
encoding the highly conserved iap (invasion-associated protein) of Listeria and 
WO 9008841 teaches nucleic acid probes capable of hybridizing to ribosomal 
RNA (rRNA) or rDNA of Listeria and not to rRNA or DNA of non-Listeria 
5 The methods described above are useful for the detection of bactena, but 

each relies upon knowledge of a gene, protein, or other specific sequence known 
a priori to be highly conserved throughout a specific bacterial group An 
alternative method would involve a nontargeted analysis of bacterial genomic 
DNA for specific non-phenotypic genetic markers common to all species of that 
10 bactena. For example, genetic markers based on single point mutations may be 
detected by differentiating DNA banding patterns from restriction enzyme analysis. 
As restriction enzymes cut DNA at specific sequences, a point mutation within this 
site results in the loss or gain of a recognition site, giving rise in that region to 
restriction fragments of different length. Mutations caused by the insertion, 
15 deletion or inversion of DNA stretches will also lead to a length variation of DNA 
restriction fragments. Genomic restriction fragments of different lengths between 
genotypes can be detected on Southern blots (Southern, J. Mol. Biol, 98, 503, 
(1975)). The genomic DNA is typically digested with any restriction enzyme of 
choice, the fragments are electrophoretically separated, and then hybridized against 
20 a suitably labeled probe for detection. The sequence variation detected by this 

method is known as restriction length polymorphism or RFLP (Botstein et al, Am, 
J. Hum. Genet. 342, 3 14, (1980)). RFLP genetic markers are particularly useful 
in detecting genetic variation in phenotypically silent mutations and serve as highly 
accurate diagnostic tools. 
25 Another method of identifying genetic polymorphic markers employs DNA 

amplification using short primers of arbitrary sequence These primers have been 
termed "random amplified polymorphic DNA" or "RAPD" primers (see Williams 
et al, NucL Acids. Res., 18, 6531 (1990) and U.S. Patent No. 5,126,239; also 
EP 0 543 484 A2, WO 92/07095, WO 92/07948, WO 92/14844, and 
30 WO 92/03! 67). The RAPD method amplifies either double or single-stranded, 

nontargeted, arbitrary DNA sequences using standard amplification buffers, dATP, 
dCTP, dGTP and TTP, and a thermostable DNA polymerase such as Taq. The 
nucleotide sequence of the primers is typically about 9 to 1 3 bases in length, 
between 50 and 80% G + C in composition and contains no palindromic 
35 sequences. RAPD detection of genetic polymorphisms represents an advance over 
RFLP in that it is less time consuming, more informative, and readily susceptible to 
automation. Because of its sensitivity for the detection of polymorphisms, RAPD 
analysis and variations based on RAPD/PCR methods have become the methods of 
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choice for analyzing genetic variation within species or closely related genera, both 
in the animal and plant kingdoms For example. Landry et al {Genome, 36, 580, 
{ 1993)) discuss the use of RAPD analysis to distinguish various species of minute 
parasitic wasps that are not morphologically distinct Van Beikum et al. (MoL 
5 Biochem ParasnoL, 61, 69, (1993)) teach the use of PCR-RAPD for the 
distinction of various species of Gtardi 

In commonly assigned U S Patent No 5,340,728, Applicants disclosed a 
method of double-nested PCR which is used to detect the presence of a specific 
microbe This disclosure first describes identifying a random, uruque segment of 
to DNA for each individual microorganism which will be diagnostic for that 

microorganism. To identify and obtain this diagnostic nucleic acid segment a 
series of polymorphic markers is generated from each organism of interest using 
single primer RAPD analysis. The RAPD series from each organism is compared 
to similarly generated RAPD series for other organisms, and a RAPD marker 
unique to all members of the group is then selected. The unique marker is then 
isolated, amplified and sequenced. Outer primers and inner primers suitable for 
double-nested PCR of each marker may then be developed. These primers 
comprise sequence segments within the RAPD markers, wherein the inner set of 
primers will be complementary to the 3' ends of the target piece of nucleic acid. 
These nested primers may then be used for nested PCR amplification to definitely 
detect the presence of a specific microorganism. 

In commonly owned PCT U.S. 95/06704 (WO 95/33854), Applicants 
more particularly adapted and described this RAPD methodology to identify a 
sequence or marker. The presence of the marker is diagnostic for all individuals of 
the genus Salmonella. PCT U.S. 95/06704 teaches a method involving a RAPD 
amplification of genomic DNA of a representative number of Salmonella 
individuals to produce a RAPD amplification product, termed the diagnostic 
fragment. This diagnostic fragment must be present in the RAPD profiles in over 
90% of the individuals tested. Sequence information from the diagnostic fragment 
enables identification of the most suitable PCR primer binding sites within the 
diagnostic fragment to define a unique diagnostic marker. Primers flanking this 
marker are useful for the generation of amplification products from Salmonella 
genomic DNA, but will not produce any amplification products in non-Salmonella 
genera. 

In commonly owned USSN 08/608,881, Applicants disclose a method, 
diagnostic sequences and primers that are useful in the identification of the 
Escherichia coli 0 1 57:H7 serotype. The method involves the identification of a 
RAPD-amplified DNA fragment common to 0157 H7 Escherichia coli. the 
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identification of the most conserved regions of that fragment, and the preparation 
of specific primers useful for detecting the presence of a marker within the 
fraument wherebv that set of primers is then useful in the identification of all 
0157 H7 Escherichia coit. The method of 08/608,88 1 does not teach markers 
5 useful for the specific identification of Listeria monocytogenes and Listeria spp. 
A detection methodology using PCR/RAPD specific to Listeria 
monocytogenes and Listeria spp would be of high utility in the food industry 
Detection methods not dependent on sequences derived from a known gene or 
associated with a known phenotypic characteristic of Listeria monocytogenes and 
10 Listeria spp. have not previously been disclosed. 

SUMMARY OF THE INVENTION 
The present invention provides a method for the specific identification of 
Listeria monocytogenes and Listeria spp. using diagnostic genetic markers. 

A method is provided for determining whether an unknown bacterium is a 
15 Listeria monocytogenes that involves; 

(A) amplifying genomic DNA from (i) a positive test panel of Listeria 
monocytogenes strains and (it) a negative test panel of non- monocytogenes 
Listeria strains with a primer derived from a pre-marker diagnostic fragment for 
Listeria monocytogenes selected from the group of nucleic acids corresponding to 

20 SEQ ED NOS: 17, 18, and 19 to yield a 1300 bp diagnostic fragment for each of 
the positive and negative test panels; 

(B) selecting at least one Listeria monocytogenes diagnostic marker 
contained within the diagnostic fragment by comparing the diagnostic fragment 
obtained from the amplification of the positive test panel with the diagnostic 

25 fragment obtained from the amplification of the negative test panel whereby at 
least one highly conserved region in the diagnostic fragment of the positive test 
panel is identified which is less than 90% homologous to any member of the 
negative test panel; 

(C) designing at least one amplification primer corresponding to the at 
30 least one diagnostic marker identified in step (B); and 

(D) amplifying genomic DNA of the unkno-wn bacterium under 
suitable annealing temperatures with the at least one amplification primer of step 
(C), whereby obtaining at least one amplification product indicates that the 
unknown bacterium is a Listeria monocytogenes. 

35 The method preferably uses Listeria monocytogenes pre-marker diagnostic 

fragments selected from the group consisting of nucleic acids corresponding to 
SEQ ID NOS 20-23. The method preferably uses Listeria monocytogenes 
diagnostic fragments that are at least 83% homologous to SEQ ED NOS:24-31 and 
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3 3-40 The method preferably uses diagnostic fragments selected from the group 
consisting of nucleic acids corresponding to SEQ ID N'OS 24-3 1 and 3 3-40 The 
method preferably uses at least one diagnostic marker selected in step (B) selected 
from the group consisting of nucleic acids corresponding to SEQ [D NOS 46-83 
5 Preferably, the amplification primers used are about 1 5 to 30 bp in length 

and suitable annealing temperatures are in the range of about 60 °C - 70 °C 

A method is also provided for determining whether an unknown bacterium 
is a member of the genus Listeria, comprising 

(A) amplifying genomic DNA from (i) a positive test panel of Listeria 
10 monocytogenes strains and (ii) a negative test pane] of non-monocytogenes 

Listeria strains with a primer derivH from a pre-marker diagnostic fragment for 
Listeria monocytogenes strains selected from the group consisting of nucleic acids 
corresponding to SEQ ID NOS: 17,18 and 19 to yield a 1300 bp diagnostic 
fragment for each of the positive and negative test panels; 
15 (B) selecting at least one Listeria genus-specific diagnostic marker 

contained within the diagnostic fragment by comparing the diagnostic fragment 
obtained from the amplification of the positive test panel with the diagnostic 
fragment obtained from the amplification of the negative test panel whereby at 
least one highly conserved region in the diagnostic fragment of the positive test 
20 panel is identified which is at least 90% homologous to the corresponding positive 
test panel of diagnostic fragment; 

(C) designing amplification primers corresponding to the at least one 
Listeria genus-specific diagnostic marker selected in step (B); and 

(D) amplifying genomic DNA of the unknown bacterium under 
25 suitable annealing temperatures with the amplification primers of step (D), 

whereby obtaining amplification products indicates that the unknown bacterium is 
a member of the genus Listeria. 

The genus-specific method at step (A) preferably uses a diagnostic 
fragment 83% homologous to any one of SEQ ID NOS:24-3 1 and 33-40 The 

30 method at step (A) preferably uses a diagnostic fragment selected from the group 
consisting of nucleic acids corresponding to SEQ ED NOS:24-3 1 and 33-40. The 
method preferably uses Listeria monocytogenes pre-marker diagnostic fragments 
selected from the group consisting of nucleic acids corresponding to SEQ ID 
NOS;20-23, The method preferably uses diagnostic markers selected in step (B) 

35 from the group consisting of nucleic acids corresponding to SEQ ED NOS: 84- 
1 10. Preferably, the method uses amplification primers of about 1 5 to 30 bp in 
length and uses a suitable annealing temperature in the range of about 60 °C to 
70°C. 
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A hybridization method for determining whether an unknown bacterium is 
a Listeria monocytogenes is provided comprising contacting the genomic DNA of 
the unknown bacterium with a nucleic acid probe selected from the group 
consisting of nucleic acid sequences corresponding to SEQ ID NOS 46-83, and 
5 then detecting hybridization of the nucleic acid probe with the genomic DNA. A 
genus-specific hybridization method for determining whether an unknown 
bacterium is a Listeria monocytogenes is provided composing contacting the 
genomic DNA of the unknown bacterium with a nucleic acid probe selected from 
the group consisting of nucleic acid sequences corresponding to SEQ CD NOS:84~ 
10 1 10, and then detecting hybridization of the nucleic acid probe with the genomic 



Isolated nucleic acid fragments are provided selected from the group 
consisting of nucleic acid fragments corresponding to SEQ ED NOS:17 through 
110. Isolated nucleic acid fragments are provided encoding the amino acid 
15 sequence as given in any one of SEQ ID NOS:32 and 4 1-45. This invention 
further provides isolated nucleic acid fragments having SEQ ID NOS: 17-1 10 

A further embodiment of the invention are nucleic acid fragments located 
on a diagnostic fragment of about 1300 bp and selected from the group consisting 
of nucleic acid fragments designated 



JNA. 



30 



25 



20 



1515(rc341x2)-26-363, 

1515(rc341x2)-27-281, 

1515-26-36, 

1515-27-357, 

15l5-26-rc233, 

1515(8585)-27-rc737, 

1515(8585)-28-rc793 

1515-30-76, 

1515-30-88, 

1515(8585)-30-624, 

:515(8585)-30-rc483 } 

1515(8585)-30-rc555, 

I515(8585)-30-rc573, 

1515(8585)-30-rc824, 



35 



the diagnostic fragment characterized by 

(A) at least 83% homology to any one of SEQ ID NOS;24-31 and 33- 

40; and 
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(B) an open reading frame of about 85 5 bp contained within the 
diagnostic fragment, the open reading frame encoding an amino acid sequence of 
any one of SEQ ID NOS 32 and 4 I -45 

In greater detail, the methods involve the following steps 
5 (t) RAPD analysis The genomic DNA of positive and negative test 

panels of a representative number of individuals for Listeria monocytogenes was 
amplified using RAPD primers The positive test panel consisted of 20 strains of 
Listeria monocytogenes and the negative test panei consisted of 25 strains of non- 
monocytogenes Listeria spp. RAPD amplification gave some amplification 
10 products specific to the positive test panel that were not seen in the negative test 
panel. 

The RAPD marker profiles from individuals of the positive test panel were 
compared with the RAPD marker profiles from individuals of the negative test 
panel and a nucleic acid fragment was selected where the fragment was present in 

15 all of the RAPD marker profiles from the positive test panel and absent in the 
RAPD marker profiles from the negative test panel. This fragment was termed a 
"pre-marker sequence". 

(ii) Se quencing : The nucleotides of the pre-marker sequence of step 
(i) were sequenced to identify available primer binding sites. 

20 (iii) Evaluation of the pre-marker sequence for Listeria 

monocytog enes s pecificity : Single primers derived from the pre-marker sequence 
were selected. These primers produced single amplification products when used to 
amplify genomic Listeria monocytogenes DNA. 

(iv) Determination and isolation of the diagnostic fragment : 

25 Sequences of the flanking regions of the pre-marker sequence were detei.nined 
revealing a diagnostic fragment of 1 300 bp. Sequencing of the diagnostic 
fragment in Listeria monocytogenes and non-monocytogenes Listeria revealed 
conserved regions specific both to Listeria spp. in general and Listeria 
monocytogenes in particular. Amplification primers were designed based on these 

30 conserved regions. 

(v) Preliminary selection of Listeria monocytogenes diagnostic 
primers on the basis of sensitivity to annealing temperature Primers unique to 
Listeria monocytogenes were identified based on the diagnostic fragment. Primer 
pairs were selected on the basis of their ability to resist the formation of non- 
35 specific amplification products as annealing temperatures were reduced. 

(vi) Final Selection of Listeria monocytogenes diagnostic primers : 
The primers of step (v) were used in the amplification of genomic DNA from a 
large group of Listeria monocytogenes (positive test panel) and non- 
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monocytogenes species ( negative test panej) under specific annealing conditions, 
confirming the specificity of these primers for Listeria monocytogenes detection. 

( v ») Preliminary Selectivity Testin u for Listeria spp diagnostic 
primers . Primers unique to Listeria spp were identified based on the diagnostic 
5 fragment Primer pairs were selected on the basis of their ability to specifically 
detect L. spp 

(viii) Selection of Listeria s p p. diagnostic primers on the basis of 
sensitivity to annealing temperature : The pnmer pairs of step (vii) were screened 
on the basis of their ability to resist the formation of non-specific amplification 
10 products as the annealing temperatures were reduced. 

(xi) Final Selection of Listeria spp. diagnostic primers : The primers of 
step (viii) were used in the amplification of genomic DNA from a large group of 
Listeria spp. (positive test panel) and non-Listeria spp. (negative test panel) under 
specific annealing conditio**, j, confirming the specificity of these primers for 
15 Listeria spp. detection. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 is a gel showing RAPD patterns for Listeria monocytogenes 
strains comprising both the negative and positive test panels, amplified with the 
12-mer primer 12CN015. The specific lanes are identified in Table 3. 
20 Figure 2 shows a comparison of the amino acid sequences of 

L. monocytogenes #647, L. innocua DP #4450, L. seeligeri DP #3327, 
I. welshimeri DP #3359, and L. ivanovii DP #3340 

Figure 3 shows the unique Listeria monocytogenes specific diagnostic 
primer sequences located at 1515(rc341x2)-26-363, 1515(rc341x2)-27-28i f 
25 1515-26-36, 1515-27-357, 151 5~26-rc233, L5 !5(8585)-27-rc737, and 

1515(8585)-28~rc793 and a comparison of priming site sequences for strains 
representing the following species: L. monocytogenes, L, innocua, L. seeligeri, 
L. welshimeri and L. ivanovii. 

Figures 4A-4C are gels illustrating the appearance of anomalous and false 
30 positive amplification products as the annealing temperature was reduced for the 
15 15-26-36/15 15-26-rc233 primer pair. The specific lanes are identified in 
Table 7. 

Figure 5 A displays the PCR product patterns of Listeria monocytogenes 
strains from the positive test panel amplified with the primer pair 
35 15 15-27-357/1 5 15(8585)-27-rc737. The specific lanes are identified in Table 8, 
column A. 

Figure 5B displays the PCR product patterns of Listeria monocytogenes 
strains from the negative test panel amplified with the primer pair 
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1 5 1 5-27-357/1 5 1 5(8585 }-27^rc73" The specific lanes are identified in Table 8. 
column B 

Figure 6 is a gei showing the seven Listeria spp specific primer sequences 
located at 1515-30-76, 1515-30-88, ! 5 I 5f 8585 )~30-624, I 5 1 5(S585)-30-rc483, 
5 1515('8585)-30-rc555, I 5 1 5f 8585)-30-rc573. and 1 5 1 5(8585)-30-rc824 and a 
comparison of priming site sequences for strains representing the following 
species. L. monocytogenes, L. innocua, L, seeligeri, L. wetshimeri and 
L. ivanovii. 

Figure 7 is a gel showing Listeria spp. positive test panel response for PCR 

10 products generated from primer set 15 1 5-30-76/ 1 5 1 5(8585)-30-rc555. The 
specific ianes are identified in Table 1 1 . 

Figure 8 is a gel showing Listeria spp. negative test panel response for 
PCR products generated from primer set 15 15-30-76/15 15(8585)-30-rc555. The 
specific lanes are identified in Table 12. 

15 Applicants have provided 1 1 0 sequence listings in conformity with 

37 C.F.R. 1.821-1.825 and Appendices A and B ("Requirements for Application 
Disclosure Containing Nucleotides and/or Amino Acid Sequences") and in 
conformity with "Rules for the Standard Representation of Nucleotide and Amino 
Acid Sequences in Patent Applications" and Annexes I and II to the Decision of 

20 the President of the EPO, published in Supplement No. 2 to the OJ EPO, 12/1992. 

Sequences of SEQ ID NOS: 1-16 are twelve-base arbitrary primers used in 
the generation of RAPD patterns. These are also shown in Table 1 . Sequences of 
SEQ ED NOS: 17-19 are single primers derived from the pre-marker sequences. 
These are also shown at Table 2. SEQ ID NOS:20 and 2 1 represent the pre- 

25 marker sequence for strain #647 and SEQ ED NOS: 22 and 23 represent the pre- 
marker sequence for strain #1324. The amino acid composition for all L. 
monocytogenes strains is represented in SEQ ED NO 32, Sequences 
corresponding to SEQ ED NOS: 24-3 1 and 33-40 are diagnostic fragments; 
Sequences corresponding to SEO ED NOS:32 and 41-45 are open reading frames 

30 encoding amino acid sequences. 

DETAILED DESCRIPTION OF THE INVENTION 
In the present method, Applicants have used RAPD amplification of 
Listeria monocytogenes and Listeria spp. genomic DNA to discover diagnostic 
fragments and primers useful for the specific detection of Listeria monocytogenes 

35 and Listeria spp. The fragments are used to generate specific primers from the 
most conserved regions for use in a PCR assay that will produce amplification 
products specific to either Listeria monocytogenes or Listeria spp 
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Applicant's method is distinctive in the following regard To selectively 
detect Listeria monocytogenes from all other Listeria or Listeria spp from ail 
other bacteria the method must be successful in determtning the most conserved 
regions of the diagnostic fragments from a phenotypically uncharacterized segment 
^ of DN A common to all Listeria monocytogenes or all Listeria spp One of skill in 
the an will recognize that conservation of sequence may be both an ally and an 
enemy in identifying the members of a particular genus For example, many 
bacterial sequences are conserved across genera and these would not be useful in 
the determination of species within a particular gen' is. It is precisely for that 
10 reason that methods previously known in the an rely primarily on the analysis of 
sequences derived from proteins or c ienes known to be specific to a particular 
genus, i.e., ribosomal RNA or toxin-encoding genes. Applicant's method departs 
from the art in that the conserved sequences of the invention are not derived from 
a known gene nor is the sequence associated with any known phenotypic 
15 characteristic. 

As used herein the following terms may be used for interpretation of the 
claims and specification, 

"Nucleic acid" refers to a molecule which can be single-stranded or double- 
stranded, comprising monomers (nucleotides) containing a sugar, phosphate and 
20 either a purine or pyrimidine. In bacteria, lower eukaryotes, and in higher animals 
and plants, "deoxyribonucleic acid" (DNA) refers to the genetic material while 
"ribonucleic acid" (RNA) is involved in the translation of the information from 
DNA into proteins. 

The term "primer-directed amplification" refers to any of a number of 
25 methods known in the art that result in logarithmic amplification of nucleic acid 
molecules using the recognition of a specific nucleic acid sequence or sequences to 
initiate an amplification process. Applicants contemplate that amplification may be 
accomplished by any of several schemes known in this art, including but not 
limited to the polymerase chain reaction (PCR) or ligase chain reaction (LCR). If 
30 PCR methodology is selected, the amplification method would include a 

replication composition consisting of, for example, nucleotide triphosphates, two 
primers with appropriate sequences, DNA or RNA polymerase and proteins. 
These reagents and details describing procedures for their use in amplifying nucleic 
acids are provided in U.S. Patent No. 4,683,202 (1987, Muliis et al.) and U.S. 
35 Patent No. 4,683,195 (1986, Muliis et al.). 

The term "pre-marker sequence" refers to a 414 bp fragment of DNA that 
is an internal region of the diagnostic fragment. 
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The term "derived from", with reference to an amplification primer, refers 
to the fact that the sequence of the primer is a fragment of the sequence from 
which it has been "derived" The fragment is aiwavs denoted in a 5' to 3' 
orientation. The useful primer sequence size range for PCR amplification is about 
5 1 5 base pairs to about 30 base pairs in length 

A "diagnostic fragment" refers to a particular DNA sequence which is 
highly conserved amongst the individuals of a particular genetically related 
population, for example, a genus, species, or serotype of bactena. In the instant 
invention, the term "diagnostic fragment" is used to refer to the composite of that 
10 DNA fragment generated during RAPD amplification and those fragments that are 
generated from amplification with sint' primers derived from the pre-marker 
sequence, which are present in the RAPD and the single primer amplification 
profiles from either I) all Listeria monocytogenes and absent from other Listeria 
spp, or 2) present in all Listeria spp. but absent in profiles from non-Listeria 
15 species. The term "diagnostic marker" is used herein to refer to that portion of the 
diagnostic fragment which can be targeted to produce an amplification product 
only in either Listeria monocytogenes or Listeria spp. The diagnostic marker is 
present only in the organism to be identified at the desired classification level (i.e., 
species or genus) and attempts to amplify the diagnostic markers in individuals not 
20 so targeted will give no amplification product. Within the context of the present 
invention diagnostic fragments which are diagnostic markers for Listeria 
monocytogenes and Listeria spp. and useful in Applicant's invention include 
nucleic acid sequences SEQ ID NOS:24-3 1 and 33-40 of about 1300 bp 
containing an open reading frame of 855 bp, encoding the peptide as given in SEQ 
25 IDNO:32. 

The terms "conserved" or "highly conserved" refer to a level of similarity 
that exists between 2 or more nucleic acid fragments where there is at least 90% 
base similarity between the fragments. The term "base similarity" refers to the 
relatedness between the nucelotide sequence of two nucleic acid molecules. 

30 Estimates of such similarity are provided by either DNA-DNA or DNA-RNA 

hybridization under conditions of stringency as is well understood by those skilled 
in the art (Sambrook et al., Molecular Cloning; A Laboratory Manual; volumes 1, 
2, 3 (Cold Spring Harbor Laboratory, Cold Spring Harbor, New York)). 

The term "primer" refers to a nucleic acid fragment or sequence that is 

35 complementary to at least one section along a strand of the sample nucleic acid, 
wherein the purpose of the primer is to sponsor and direct nucleic acid replication 
of a portion of the sample nucleic acid along that strand. Primers can be designed 
to be complementary to specific segments of a targeted sequence. In PCR, for 
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example, each primer ts used in combination with another primer forming a 
''primer set" or "primer pair", this pair clanks the targeted sequence to be amplified 
In RAPD amplification, single arbitrary primers are used to amplify nontargeted 
segments of nucleic acid which are located between the primer sequence sites in 
5 opposing DNA strands. The term "primer", as such, is used generally by Applicant 
to encompass any sequence-binding oligonucleotide which functions to initiate the 
nucleic acid replication process. "Diagnostic primers" will refer to primers 
designed with sequences complementary to primer binding sites on the diagnostic 
marker. Diagnostic primers are useful in the convenient detection and 
10 identification of diagnostic markers specific to Listeria monocytogenes and 
Listeria spp. 

A "genetically related population" refers to any grouping of micro- 
organisms possessing multiple or single genotypic or phenotypic characteristics of 
sufficient similarity to allow said organisms to be classified as a single genus, 
15 species, or subspecies of bacteria. For purposes of the present disclosure, 
examples of genetically related populations include, for example, Listeria 
monocytogenes and Listeria spp. 

A "test panel" refers to a particular group of organisms or individuals 
selected on the basis of their genetic similarity to each other or on the basis of their 
20 genetic dissimilarity to another group (i.e., another genus, species, subspecies or 
serotype). A "positive test panel" will refer to a number of individuals selected for 
the desired genetic similarity between those individuals and, in the instant case, will 
be comprised of individuals of either Listeria monocytogenes or Listeria spp. 

Similarly, a "negative test panel" will refer to a test panel selected on the 
25 basis of genetic diversity between its members and the members of the positive test 
panel. A suitable negative test panel in the present invention would be comprised 
of non-Listeria monocytogenes where L. monocytogenes is the target organism or 
non-Listeria spp. where Listeria spp. is the target organism. 

The term "unknown microorganism" or "unknown bacterium" is a 
30 microorganism or bacterium whose identity is undetermined. 

The term "amplification product" refers to specific DNA fragments 
generated from any primer-directed nucleic acid amplification reaction. The 
diagnostic markers of the present invention are amplification products generated in 
PCR reaction using diagnostic primers and are useful for the detection of Listeria 
35 monocytogenes and Listeria spp. 

The term "RAPD" refers to "random amplified polymorphic DNA". 
"RAPD amplification" refers to a method of single pnmer-directed amplification of 
nucleic acids using short primers of arbitrary sequence to amplify nontargeted, 
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random segments of nucleic acid The method is disclosed and claimed in U S 
Patenr No 5, 126,239 "RAPD method" or "RAPD analysis" refers to a method 
for the detection of genetic polymorphisms involving the nontargeted amplification 
of nucleic acids using short primers of arbitrary sequence, whereby the profile or 
5 pattern of "RAPD" amplification products is compared between samples to detect 
polymorphisms "RAPD primers" refers to primers of about 8 to 13 bp, of 
arbitrary sequence, useful in the RAPD amplification or RAPD analysis according 
to the instant method. The "RAPD marker profile" refers to the pattern, or 
fingerprint, of amplified DNA fragments which are amplified during the RAPD 
10 method and separated and visualized by gel electrophoresis. 

The term "ribotype" refers to a specific classification of bacteria or other 
microorganisms based on the digestion of genomic DNA with a restriction 
endonuclease, electrophoretic resolution of the restricted DNA and visualization of 
those fragments containing rDNA sequences by means of hybridization with a 
15 probe derived from the rDNA operon. 

The diagnostic marker of the invention can be used to identify any member 
of either Listeria monocytogenes to the exclusion of other Listeria spp., or 
Listeria spp. to the exclusion of all other bacteria. In the present invention, 
diagnostic primers flanking the marker are useful to amplify the marker using PGR- 
20 Alternatively, nucleic acid probes could be developed based upon some or all of 
the diagnostic marker sequences and thus used to detect the presence of the 
marker sequence using standard solid phase or solution nucleic acid hybridization 
and reporter methods. It is contemplated that regions of about 30 base pairs or 
more of *he diagnostic marker, especially encompassing the primer regions could 
25 be used as sites for hybridization of diagnostic probes. These methods might be 
used specifically for the detection of Listeria monocytogenes or Listeria spp. in 
food, human or animal body fluids or tissues, environmental media or medical 
products and apparatti. 

The instant method is more particularly described below with reference to 
30 the specific method steps as provided in the Summary of the Invention. 

Selection of RAPD Primers and Detection of Diagnostic Fragment in Members of 
the Positive and Negative Test Panels, step (\Y 

Genomic DNA isolated from positive and negative test panels of 
microorganisms was subjected to RAPD amplification using sixteen 12-base 
35 primers of arbitrary sequence. The positive test panel consisted of 20 strains of 
Listeria monocytogenes and is described in detail in the GENERAL METHODS 
section below The negative test panel consisted of a variety of 25 Listeria spp. 
and is also described in the GENERAL METHODS section below. Techniques 
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tor the isolation of genomic DN'A are common and well known in the art and 
examples may be found in Sambrook et al , Molec ular Cloning A Laboratory 
Manual - volumes 1. 2 t 3 (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York (1989)) 

RAPD primers of 12 bases m length were used because at this primer 
length the RAPD patterns generally contained one to five amplified DNA 



J fragments L/se of shorter primers frequently resulted in a large number of 

M amplification products, which made the extraction of a single homogeneous 

fragment for sequencing much more difficult. When primers of greater than 
$ 10 12 bases were used a significant fraction of the bacterial strains produced no 

z RAPD products which would have necessitated the screening of a much larger 

f number of arbitrary primers. One of the primers, designated 12CN015 (Table I, 

GENERAL METHODS), was found to produce a 414 bp amplification product 
(termed a pre-marker sequence) in all of the positive test panel. 12CN01 5 had the 
15 sequence of GGA CAG AGC ATA (SEQ ID NO: 15). The primer, 12CN15, was 
found to produce a 414 bp amplification product in all L. monocytogenes strains. 
This 414 bp pre-marker sequence was not observed in the amplification products 
of the negative test panel using the same primer. Examples of the 12CN15 RAPD 
patterns for strains of both test panels is shown in Figure 1. 
20 Sequencing of Pre-marker Sequence, step (ii): 

Since the 414 bp product was unique to L. monocytogenes, samples of this 
product were isolated for two strains different of L. monocytogenes, DP #647 and 
DP #1324, and the respective products sequenced. The two strains represented a 
ribotype that was highly polymorphic (#1324) based on RAPD patterns and a 
25 ribotype that is pathogenic (#647). The object of selecting a common pathogenic 
strain and a polymorphic strain ofL, monocytogenes, was to characterize the 
genetic diversity likely to be found within the 12CN15 marker fragment. 

The complete sequences of the 414 bp products for DP #647 and #1324, 
including the flanking 12CN15 sequences, are shown in SEQ ID NOS:20 and 21 
30 for DF #647 and SEQ ED N0 21 and 23 for DP #1324. Comparison of the 

DP #647 and #1324 sequences shows a 98% homology. Both sequences appear 
to be an internal section of an open reading frame (ORF) with the same amino acid 
composition. 

Evaluation of Pre-marker Sequence fox Listeria monocytogenes Sp ecificity, step (iiiV 
35 The purpose of the initial PCR screening was to identify sequence domains 

that demonstrated species selectivity. Primers based on the 414 bp pre-marker 
sequences were first evaluated for their ability to specifically amplify from 
L. monocytogenes genomic DNA. Initial primer sequences were 26 bases long 



15 



BNSDOCID: <:WO 98201 6GA1J_> 



WO 98/20160 



with a GC composition of 50 r > 0< j to allow for an annealing temperature in the 
range of 70 °C Pruning sites were selected within a distance of 200 bases from 
each 12CNI5 priming site These sequences were examined to insure that inter- 
and intra-primer interactions were minimized to avoid the production of 
5 nonspecific PCR artifacts 

Many of the initial primer pairs that were tested generated multiple 
amplification products from genomic L. monocytogenes DNAs. Multiple PCR 
products can occur when at least 8 bases of one 3' primer sequence appears as an 
inverted repeat in the region adjacent to the originally targeted priming sites To 
10 determine which products were the result of inverted repeats of a single primer 
sequence and which primers were responsible for these products, amplification 
reactions were run using single primers. A significant number of single primers 
such as 1515-26-85, (SEQ ID NO: 17) I5I5-26-rc292 (SEQ ID NO; 1 8) and 
15 15-26-rc34I (SEQ ID NO: 19) (Table 2) were capable of generating PCR 
15 products in single primer amplifications. 

Determination and Isolation of the Diagnostic Fragment, step (iv) : 
Determination of sequences adjacent to the RAPD marker 
Each of the single primers, 1515-26-85, (SEQ ID NO:17) 1515-26-rc292 
(SEQ ID NO:18) and 1515-26-rc341 (SEQ ID NO:19) generated a PCR product 
20 that contained part of the original 414 bp fragment plus additional sequence. 

Sequence determination of these augmented CN15 fragments was accomplished by 
means well known in the art. 

Sequencing revealed approximately 900 bases of additional sequence. The 
new sequence consisted of 400 bases in the region preceding the original 12CN15 
25 site and 500 bases in the region following the second 12CN15 site revealing a 
complete open reading frame (ORF). 

The degree of conservation within this genetic locus was further examined by 
amplification of additional Listeria monocytogenes strains and comparison of those 
amplification products with the strains already sequenced. 
30 Comparison revealed that ail strains contain a 1300 bp diagnostic fragment 

containing a complete open reading frame of 855 bases. This sequence was used 
as the diagnostic fragment and is identified by SEQ ED NOS:24-3 1 . To determine 
if this fragment contained selective priming sites for Listeria monocytogenes 
specific primers, further analysis of the sequence composition was done to 
35 determine if it was distinct from other Listeria spp. 
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BASE SIMILARITY COMPARISON BETWEEN [ tQOBP DIAGNOSTIC FRAGMENTS 
A comparison of the base similarity between all isolated 1300 bp diagnostic 
fragments is given below 
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Sequencing marker fragment for other Listeria species - determination of 
sequence composition of diagnostic fragment 

Evaluation of primer sites in the 1300 bp diagnostic fragment showed that 
not all primer sites were specific for Listeria monocytogenes, suggesting 

to conservation of the sequence composition at the genus level. 

The genetic composition was explored by isolating and sequencing the 
855 bp fragment and the flanking regions from a panel of non-monocytogenes 
strains. All strains were found to contain the 855 base open-reading frame. 
However, a comparison of the sequences demonstrated there were species- 

15 dependent differences in the nucleic acid composition within the ORF SEQ ID 
NOS:33-40 and Table 3, Example 3), The specific locations of these differences 
suggest priming sites where the diversity of nucleotide sequence may be sufficient 
to allow for selective priming of Listeria monocytogenes strains. 
Preliminary selection of Listeria monocvtosenes diagnostic primers on the basis of 

20 sensitivity to annealing temperature, step (v) : 

A comparison of sequence data for the fiveL, spp. made it possible to 
identify seven priming sites that were unique to L. monocytogenes. These priming 
sites were as follows; 15l5(rc34lx2)-26-363, 1515(rc341x2)-27-281, 1515-26-36, 
1515-27-357, 15 15-26-rc233, 15 1 5(8585)-27-rc737, and 1515(8585)-28-rc793 

25 (Figure 3) 26 to 30 bases in length depending on the GC composition. Longer 
primers were used for sites with a GC composition of < 50% to assure that a 
70 °C annealing temperature could be maintained. Figure 7 shows the primer 
sequences and a comparison of priming site sequences for strains representing the 
following species. L. monocytogenes, L. innocua, L. seeligeri, L, welshimeri and 

30 L. ivanovii. 

To minimize the likelihood that L. monocytogenes selective primers would 
generate PCR products from mismatched priming in non-monocytogenes strains, 
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the relationship between annealing temperature and selectivity was evaluated. A 
test panel consisting of both Listeria monocytogenes and non-monocvrogenes 
strains was prepared and amplifications were carried out over a range of 
temperatures and analyzed for the appearance of amplification products from 
5 Listeria spp. that were nan-monocytogenes Primer pairs rhat showed an onset of 
non-specific amplification at or above 65 °C were considered as unsuitable for use 
as Listeria monocytogenes diagnostic primers. 

Final selection of Listeria monocytogenes diagnostic primers, step fviV . 

Before making the final primer selection, the accuracy of ail candidate 
iO primer sets was evaluated for a larger group of monocytogenes strains. A set of 
323 strains of L. monocytogenes was used to evaluate the inclusivity of the 
following primer sets: 1) 15 L 5-27-3 57/1 5 1 5(8585)-27-rc737; 

2) i 5 1 5-27-3 57/ 1 5 1 5(8585)-28-rc793 ; and 3) 1515(rc341x2)-26~(-363)/1515-26- 
rc233 (Figure 3). Accuracy in detection of Listeria monocytogenes ranged from 

15 99.5% to 100% depending on the primers and amplification conditions used, 
None of the three primer sets generated amplification products for the 30 non- 
monocytogenes strains in the annealing stringency test panel. 
Preliminary selectivity testing of Listeria spp. diagnostic primers, step (viiY 
A comparison of sequence data for the five above-mentioned Listeria 

20 species made it possible to identify seven priming sites present in strains 
representing these species that were at least 90% homologous to 
L. monocytogenes. These priming sites were as follows: 15 1 5-30-76, 
1515-30-88, 1515(8585)-30^624, 1 5 1 5(8585)-30-rc483, 15i5(8585)-30-rc555 > 
t515(8585)-30-rc573, and 15I5(8585)-30-rc824 (Figure 6). Primers to these sites 

25 were made 30 bases in length to compensate for mismatches in sequence and to 
assure that a 70 °C annealing temperature couid be maintained. 

Before evaluating stringency conditions, the accuracy of all candidate 
primer sets was evaluated for a group of 33 L. spp. strains. These strains were 
used to evaluate the inclusivity of the following primer sets: 

30 1) 1515-30-7:/i515(8585)-30~rc483; 2) 15 15-30-76/1 5 15(8585)-30-rc555; 

3) 1515~30-76/l515(8585)-30-rc573, 4) 15 15-30-76/15 1 5(8585)-30-rc824; 
5) 1 5 1 5-30-88/1 5 1 5(8585)-30-rc483, 6) 1515-30-88/15i5(8585)-30-rc555, 
7) 1 5 1 5-30-88/15 1 5(8585)-30-rc573;8) 1 5 15(8585)-30-624/1515(8585)-30-rc824. 

The accuracy in the detection of L. spp. was 100% for the following 
35 primer sets: 1 5 1 5-30-76/1 5 1 5(8585)-30-rc555, 1 5 i 5-30-76/1 5 1 5(8585)-30-rc573; 
1 5 1 5-30-88/15 1 5(8585)-30-rc555 1 and 15 1 5-30-88/1 5 1 5(8585)-30-rc573 . 
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Selectipn_of Listeria spp. diagnostic primers, on t he basis of sensitivit y to annealing 
tem perature, step (viii) 

As with Listeria monocytogenes, the relationship between annealing 
temperature and selectivity was evaluated to minimize the generation of non- 
5 specific amplification products. This test was performed for all the primer pairs 
that scored 100% in the preliminary mdusivity evaluation Amplifications were 
earned out on a test panel of 7 strains of Listeria spp. and non-L. spp. at an 
annealing temperature of 70 °C Annealing temperature was then decreased in 
5 °C increments until the onset of nonspecific amplification products. The 

10 1 5 1 5-30-76/1 5 1 5(85S5)-30-rc5 55 primer set was found to be the (east likely to 
generate nonspecific amplification products. 
Final selection of Listeria spp. diagnostic primers, step fix) : 

Before confirming the selection of the 1515-30-76/1 5 !5(8585)-30-rc555 
primer set, the accuracy of this primer set was evaluated for a group of 73 L. spp. 

15 strains. Accuracy in detection of L. monocytogenes, L. innocua, L. welshimeri, 
L. seeiigeri, and L. ivanovii was 100%. This primer set did not generate amplifi- 
cation products for the 65 non-Listeria spp. strains in the non-Listeria test panel. 

To determine whether the selected primer set would generate PCR products 
from mismatched priming in non-monocytogenes strains, the relationship between 

20 annealing temperature and selectivity was evaluated. A test panel consisting of 
both Listeria spp. and non-L. spp. strains was prepared and amplifications were 
carried out over a range of temperatures and analyzed for the appearance of 
amplification products from non-L. spp. In this fashion, it was determined that 
60 °C is the lowest annealing temperature that will result in specificity forL. spp. } 

25 and 70 °C is the preferred temperature that should be used when using this set of 
Listeria spp. specific primers, 

EXAMPLES 
GENERAL METHODS 
Procedures for DNA amplifications and other protocols common in the art 
30 of molecular biology used in the following examples may be found in Sambrook, J. 
et a!., Molecular Cloning: A Laboratory Manual , Second Edition, Cold Spring 
Harbor Laboratory Press, Coid Spring Harbor, NY (1989). 

Materials and methods suitable for the maintenance and growth of bacterial 
cultures are well known in the art. Techniques suitable for use in the following 
35 examples may be found in Manual of Methods for General Bacteriology (Phillipp 
Gerhardt, R. G E Murray, Ralph N Costilow, Eugene W. Nester, Willis A. 
Wood, Noel R. Krieg and G. Briggs Phillips, eds), American Society for 
Microbiology, Washington, DC. (1994) or in the work of Thomas D Brock (in 
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Biotechnology A Textbook of [ndustnal Microbiology . Second Edition ( 1989) 
Sinauer Associates, Inc., Sunderland, MA) All reagents and materials used for 
the growth and maintenance of bacterial cells were obtained from AJdrich 
Chemicals (Milwaukee, WI), DfFCO Laboratories (Detroit, MI), GfBCO/BRL 
5 (Gaithersburg, MD), or Sigma Chemical Company (St. Louis, MO) unless 
otherwise specified. 

The meaning of abbreviations is as follows "h" means hour(s), "nun" 
means minute(s), "sec" means second(s), "d" means day(s), "mL M means milliliters, 
"L" means liters. 

10 DNA sequencing was performed by Lark Sequencing Technologies Inc., 

(Houston, Te^as) or according to the method of Sanger et ai. (Proc. NatL Acad. 
Sci. t USA 74, 5463, (1977)) using fluorescence-labeled dideoxynucieotides and 
the Genesis 2000™ DNA Analysis System (E. I. du Pont de Nemours and 
Company, Wilmington, DE). 
15 Construction of Positive and Negative Test Panels 
Listeria monocytogenes 

A positive test panel consisting of 20 genotypically different Listeria 
monocytogenes strains was constructed for the identification of a Listeria 
monocytogenes and Listeria spp. RAPD marker. The Listeria monocytogenes 
20 strains of the postive test panel encompassed all commonly encountered serotypes 
and included L. monocytogenes, DP #647, #899, #1324 and #3386. 

The negative test panel consisted of 25 different Listeria non- mono- 
cytogenes strains comprising the species L. innocua, L. seeligeri, L, welshimeri, 
L. grayi, L. murrayi and L. ivanovii. 
25 RAPD Primers 

RAPD primers used for amplification of genomic DNA from the positive 
and negative test panels are given below in Table 1 

TABLE I 

Tweive-Base Arbitrary Primers Used in the Generation of RAPD Patterns 



L2CN01- 


AGC TCA TGC TAC 


SEQ 


ID 


NO : 


1 


I2CN09- 


AAC CTC GTG TAG 


SEQ 


ID 


NO: 


9 


12CN02- 


AGT CGA ACT GTC 


SEQ 


ID 


NO: 


2 


12CN10- 


CAT TCG GGT ACA 


SEQ 


ID 


NO: 


10 


I2CN03- 


TTA GTC ACG GCA 


SEQ 


ID 


NO: 


3 


L2CNU- 


GCC CTT AGT GAA 


SEQ 


ID 


NO : 


11 


12CN04- 


TGC GAT ACC GTA 


SEQ 


ID 


NO : 


4 


12CN12- 


GCA GTT ATG ACC 


SEQ 


ID 


NO: 


12 


I2CN05- 


CTA CAG CTG ATG 


SEQ 


ID 


NO: 


5 


12CN13- 


CCA GCT ATC TCT 


SEQ 


ID 


NO: 


13 


12CN06- 


GTC AGT CGA ACT 


SEQ 


ID 


NO: 


6 


12CNH- 


AGA AGG CAG TTG SEQ 


ID 


NO: 


14 


12CN07- 


GGC ATT AGT CAC 


SEQ 


ID 


NO: 


7 


12CN15- 


GGA CAG AGC ATA SEQ 


ID 


NO: 


15 


12CN08- 


CGT ATG CGA TAC 


SEQ 


ID 


NO: 


8 


12CN16- 


CGT TTC GCT TCA 


SEQ 


ID 


NO: 


16 



20 
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Pnmer Nomenclature 

Primers names identified in the following examples are derived in the 
following manner The first number, 1515, indicates the primer sequence comes 
from the RAPD fragment primed at both ends by 1 2CN 1 5 The second number 
5 indicates the primer length The third number identifies the 3' base position of the 
primer, where position I is at the 5' end of the original putative 12CN15 priming 
site The rc designation means that the priming site is located on the 
complementary strand. 

Primer locations may also be referred to by the last number of the 
10 identifying number. Thus "1515-26-85" may be referred to as "85" 

RAPD primers may be referred to with out the "12" designation. Thus 
"12CN15" may also be referred to as "CN15". 

The primers described below in Tabie 2 are derived from the pre-marker 
sequence. As single primers they match the pre-marker sequence and they also 
15 match at least the last 9 bases of 3' sequence at another location outside of the pre- 
marker sequence. The net effect is that these primers will generate amplification 
products in single primer reactions from genomic DNA. These products contain 
part of the original 414 bp fragment plus additional sequence. 



TABLE 2 


SPECIFICITY 


LD. 


SEQUENCE 


SEQ ID NO 


Pre-marker 


1515-26-85 


TGC TGT TTG GTT TGC TCT AGC 
CCA GTG 


SEQ ID NO: 17 




I5i5-26-rc292 


CAA Cl'l' TCC ACA TGG CGC GAT 
TAT TTG 


SEQ ID NO: 18 




1515-26-rc341 


GGG GAA CTG CCG AAG ATC GTA 
CAG CA 


SEQ ID NO: 19 



EXAMPLE 1 

20 ISOLATION OF PRE-MARKER SEQUENCE FROM LISTERIA 

MONOCYTOGENES BY RAPD ANALYSIS 
Example 1 details the isolation of the 4 14 bp pre-marker sequence from 
Listeria monocytogenes genomic DNA using Random Amplified Polymorphic 
DNA RAPD primer analysis. 
25 A set of eight 12-base primers (Tabie I) was used in a RAPD analysis of 

20 strains of Listeria monocytogenes The results of these amplifications were 
examined for a Listeria monocytogenes-spetiftc amplification product that could 
be easily separated from other RAPD products 



1 



WO 98/20160 



} PCT/US97/19896 



The amplification and data acquisition protocol for each 12-base primer 
RAPD reaction was as follows 

Amplification protocol For each 50 uL reaction, 1 5 uL - dNTP mix 
(5 miVl dNTP each), 36 3 uL - deionized water, 5 uL - I OX reaction buffer 
5 (500 mM KC1, 100 mM tris % pH 3 3, 1 5 mM MgCU 0 003% gelatin), 5 uL - 
primer (10 mM), 0.4 pL - Taq polymerase (5 U/uL). and 1.2 uL - Taq dilution 
buffer ( 10 mM tris @ pH 8.0 and 1 .0% Tween 20) were combined. A 1 .0 
aliquot of genomic bacterial DNA @ 50 ng/}^L wis added to each mixture. The 
reaction was heated to 94 °C for 2 min. Twenty- eight cycles of the following 

10 temperature profile were run: 1 5*; @ 94 °C, 5' % 46 °C 2' ramp to 72 °C, and V 
@ 72 °C. At the end of cycling reaction was incubated at 72 °C for 7 min. 

After amplification a 5 aliquot of the reaction was combined with 2 mL of 
Ficoi-loading buffer and run on a 4% acrytamide gel (29; I)/ 1. Ox TBE. Follow~tng 
electrophoresis, the geis were stained with ethidium bromide. The stained gels 

15 were placed on a transilluminator and electronically imaged with a high sensitivity 
CCD camera. Images were stored in computer memory for subsequent analysis. 
Analysis : The primer, 12CN15, (SEQ ID NO: 15) was found to produce a 414 bp 
(SEQ ID NO:32) amplification product in all L. monocytogenes strains. This 
414 bp fragment was not observed in the amplification products of the negative 
test panel using the same primer. Examples of the 12CN15 RAPD patterns for 
strains of both test panels is shown in Figure 1. The lanes are correlated to 
Figure 1 as follows in Table 3 : 



TABLE 3 



Lane 


Strain 


Lane 


Strain 


I 


L. welshimeri #943 


13 


L. ivanovii #1165 


2 


L> monocytogenes #945 


14 


L. monocytogenes #1281 


3 


L. seeligeri #949 


15 


L. monocytogenes #1287 


4 


L. innocua #950 


16 


L. seeligeri #1290 


5 


L. monocytogenes #1047 


17 


L. monocytogenes 01283 


6 


L. seeligeri #1059 


18 


L. monocytogenes 01295 


7 


L. seeligeri #1061 


19 


L. monocytogenes #1299 


8 


L. monocytogenes #1068 


20 


L. seeligeri 01303 


9 


L. monocytogenes #1069 


21 


L. monocytogenes #1313 


10 


L. monocytogenes #1070 


22 


L. seeligeri 01317 


11 


L. innocua #1089 


23 


L. monocytogenes #1324 


L2 


L. innocua #1157 


24 


L. monocytogenes #1963 



* Arrows denote 4 14 bp product in L. monocytogenes strains. (See Lanes 2. 5, 3, 10, 14, 15, 18, 
19. 21 and 24 of Figure 1.) 
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As ts evident from Figure L the positive test panel produced a 
characteristic amplification product of 414 bp which appeared in ail of the 20 
Listeria monocytogenes strains tested. Additionally it ts seen that none of the 
negative test panel group showed the 414 bp amplification product seen in the 
5 positive test panel 

EXAMPLE 2 

GENERATION OF THE DIAGNOSTIC FRAG MENT FROM SEQUENCE 

FLANKING THE PRE -MARKER SEQUENCE 
Example 2 illustrates the sequencing of the flanking regions of the 414 bp 
LO pre-marker sequence using single primers and the generation of the diagnostic 
fragment. 

The 414 bp pre-marker sequence of Example 1 was commercially 
sequenced from two strains of Listeria monocytogenes (#647 and #1324) 
described above. The pre-marker sequence for each strain is given in SEQ ID 
15 NOS:20 and 21 for #647 and SEQ ID NOS:22 and 23 for #1324. 

Primers were designed based on the pre-marker sequence, and evaluated 
for their ability to specifically amplify from Listeria monocytogenes genomic 
DNA. Initial primer sequences were 26 bases long with a GC composition of 50 ± 
5% to allow for an annealing temperature in the range of 70 °C. Priming sites 
20 were selected within a distance of 200 bases from each 12CN15 priming site. 

Following this method three single primers were identified as 1515-26-85 (SEQ ID 
NO: 17), 1515-26-rc292 85 (SEQ ID NO: 18), and 15 15-26-rc341 85 (SEQ ID 
NO: 19), (Table 2) which generated a PCR product in the absence of a second 
primer. 

25 Each of these single primers generated a PCR product that contained part 

of the original pre-marker sequence plus additional sequence. Sequence of these 
augmented 12CN15 fragments was accomplished using the chain-termination 
method of Sanger et al. (Proc. Natl. Acad. ScL, USA 74, 5463, (1977)) using 
fluorescence-iabeied dideoxynucleotides and the Genesis 2000™ DNA Analysis 

30 System (E. I. du Pont de Nemours and Company, Wilmington, DE). 

The new sequence consisted of 400 bases in the region preceding the 
original 12CN15 site and 500 bases in the region following the second 12CN15 
site. This sequence data made it possible to identify a complete ORF plus several 
hundred bases upstream and downstream from the reading frame. 

35 To further characterize the degree of conservation within this genetic locus, 

nucleic acid sequences were determined for two additional strains of 
L. monocytogenes. DP #899 and DP #3386. DP #899 is an additional 
representative of a known pathogenic L. monocytogenes ribotype group. 
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DP -3 580 is a strain of a less pathogenic L. monocytogenes ribotype group 
Sequencing of these strains was accomplished using the chain-termination method 
of Sanger et aL f supra) and the Genesis 2000 rM DNA Analysis System. The 
complete sequences for the L. monocytogenes strains, DP #647, #899, =M324 and 
5 -3386, are shown in SEQ ID N'OS 24-3 1 (The 5' end of the ongmai CN15 
priming site was designated as base number I to provide a common reference 
point for all of the sequence data.) 

.Analysis of the nucleic acid sequence showed that all four strains contained a 
complete open reading frame of 855 bases. Within this ORF DP #647, #899, and 

to #3386 all showed identical nucleic acid sequences DP #1324 is 97% homologous 
with the other three L, monocytogenes strains. When the nucleic acid sequences 
were translated into the corresponding amino acid sequence DP #1324 was found 
to be identical to the other three strains. The amino acid composition for all of the 
L. monocytogenes strains is shown in SEQ ID NO: 3 2). 

15 The promoter and the terminal spacer sequences for DP #647, #899, and 

#3386 were also identical. DP #1324 shows an homology of 98% and 96% with 
the promoter and the terminal spacer, respectively. This genetic locus and the 
surrounding region showed a high level of conservation among strains of 
L. monocytogenes. 

20 Further analysis of the sequence composition of the 1300 bp diagnostic 

fragment containing the 855 bp ORF was needed to determine if it was sufficiently 
distinct from other L. spp so as to provide selective priming sites for a PCR-based 
assay for L. monocytogenes. 

EXAMPLE 3 

25 DETERMINATION OF SEQUENCE COMPOSTITION OF DIAGNOSTIC 
FRAGMENT BY COMPARISON OF MARKER FRAGMENT SEQUENCE 

WITH OTHER LISTERIA SPECIES 
A preliminary evaluation of primer sites in the augmented CN1 5 marker 
demonstrated that many locations did not discriminate between L. monocytogenes 
30 and other L. spp. This observation suggested that much of the sequence 

composition of this genetic locus was conserved at the genus level. To determine 
whether this genetic locus contained any sequences that were unique to 
L. monocytogenes, strains representing L. innocua, L. seeligeri, L. welshimeri 
and L. ivanovii were also selected for sequencing. (Marker sequences were not 
35 determined for L. grayi and L, murrayi because these species were considerably 
more polymorphic than other L. spp.) Previously determined non-selective 
priming sites were used to generate quantities of DNA suitable for sequencing. As 
withL, monocytogenes, primers 1515-26-85, (SEQ ED NO: 17), 15l5-26-rc292 
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(SEQ ID NO 18), and 1 5 i5-2o-rcJ41 (SEQ ID NO \9) t used in single primer 
amplifications also generated PCR products suitable for sequencing Sequencing 
was accomplished using the chain-termination method of Sanger et al.. and the 
Genesis 2000™ DNA Analysis System 
5 A comparison of the nucleotide sequences showed that each of the above 

species contained an 855 base open-reading frame However, there were some 
species dependent differences in the nucleic acid composition within the ORF 
Sequences SEQ ID NOS:33-40 show the nucleic acid composition for the 
following strains: L. innocua DP #4450, L. seeligeri DP #3327, L. welshimeri 
10 DP #3359, and L. ivanovii DP #3340, respectively These differences in 

nucleotide sequence are responsible for corresponding species variations in amino 
acid sequence. The nucleotide and amino acid sequence homology is summarized 
in Table 4. Figure 2 shows a comparison of the amino acid sequences of each 
L. spp., including L. monocytogenes. 

TABLE 4 

Open Reading Frame Nucleotide and Amino Acid Sequence Comparison between 

Listeria Species 

Nucleotide Homology to Amino Acid Homology to 
Species L. monocytogenes #647 L. monocytogenes #647 

L. innocua 86.7% 95.8% 

L. seeligeri 84.5% 94.0% 

L. welshimeri 83.8% 93.7% 

L. ivanovii 83.3% 93.3% 

15 Of the L. spp. that were tested, L. innocua clearly showed the greatest 

similarity to L. monocytogenes. The remaining three species all showed roughly 
comparable levels of divergence in nucleotide and amino acid composition. 

The promoter and terminal spacer nucleic acid sequences also showed 
significant differences between species. The homologies of promoter and spacer 

20 sequences compared to L. monocytogenes are summarized in Table 5 

TABLE 5 

Promoter and Terminal Spacer Sequence Comparison between L. Spp. 

Promoter Homology to Terminal Spacer Homology to 
Species L. monocytogenes #647 L. monocytogenes #647 

L. innocua 91.9% 79.4% 

L. seeligeri 85 .0% 75.4% 

L. welshimeri 90 9% 75.5% 

L. ivanovii 83.7% 84.0% 
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The homology to L. monocytogenes found in promoter and terminal spacer 
sequences is significantly different from that observed for the open reading frame 
sequences L. innocua and L. welshimeri both show homologies to 
L. monocytogenes in the promoter region that are significantly greater than the 
5 corresponding ORF homology For L. seeligen and L. ivanovii the promoter 
regions and ORF regions both show comparable levels of homology with 
L. monocytogenes. In three species the terminal spacer sequences show a notable 
decrease in homology compared to the promoter regions and open reading frames, 
while in L, ivanovii similar levels of homology to L. monocytogenes are found in 
10 all three regions. 

EXAMP LE 4 

SELECTION OF LISTERIA MONOCYTOGENES DIAGNOSTIC PRIMERS ON 

THE BASIS OF SENSITIVITY TO ANNEALING TEMPERATURE 
Identification of Priming sites and Primer synthesis : 
15 A comparison of sequence data for the five L. spp, (Example 3) made it 

possible to identify seven priming sites that were unique to L. monocytogenes 
These priming sites were as follows: 1 5 1 5(rc34 1 x2)-26-363, 
1515(rc341x2)-27-281, 1515-26-36, 1515-27-357, 1515-26-rc233, 
1515(8585)-27-rc737, and 1515(8585)-28-rc793 (Figure 3). "Uniqueness" is 

20 defined as any sequence region of 26-30 bases where the homology between L. 
monocytogenes and any other Listeria spp, is less than 90%. 

Primers were synthesized to correspond to these sites and ranged in length 
from 26 to 30 bases depending on the GC composition. Longer primers were used 
for sites with a GC composition of <50% to assure that a 70 °C annealing 

25 temperature could be maintained. 

Figure 3 shows the primer sequences and a comparison of priming site 
sequences for strains representing the following species: L. monocytogenes, 
L. innocua, L. seeligeri, L* welshimeri and L. ivanovii. 
Determination of annealing termperature : 

30 To minimize the likelihood that L. monocytogenes-selectivQ primers would 

generate PCR products from mismatched priming in non-monocytogenes strains, 
the relationship between annealing temperature and selectivity was evaluated. A 
33-strain test panel consisting of 15 L. innocua, 5 L, welshimeri, 6 L, seeligeri, 3 
L. ivanovii, 1 L. grayi and 3 L. monocytogenes was prepared. Amplifications 

35 were first carried out at an annealing temperature of 70 °C. Annealing 

temperature was then decreased in 5 °C increments until amplification products 
began to appear for the non-monocytogenes strains The amplification and 
detection protocol is summarized below. 
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Combine I 5 uL - dNTP mix (5 mM each dNTP), 40 uL - deioni2ed water. 
5 uL - (OX reaction buffer (500 mM KCl 100 mM tns pH S 3. 15 mM MgCk 
0 003° -b gelatin) 0.4 uL - Taq polymerase (5 U/uL), I 2 uL - Taq dilution buffer 
( 10 mM tns @ pH 8 0 and 1.0% Tween 20), 0 66 aL - each primer (10 uM), and 
5 I 0 uL - genomic DNA % 50 ng/pL. Heat to 94 ~C tor 2 min. Thirty-five cycles 
of the following temperature profile were run. 1 5 M §> 94 °C, 2' at the specified 
annealing temperature and V @ 72 °C. At the conclusion of cycling the reaction 
was incubated at 72 °C for 7 min. 

Visualization of amplification products is the same as described above, 
to Table 6 shows a comparison of the onset of false positive responses as the 

annealing temperature is reducea. 

TABLE 6 

Percent False Positive L. monocytogenes Responses as a Function of 

Annealing Temperature 







Annealing 


Temperature, C. 




Primer Sets 


70° 


65° 


60° 


55° 


1515-27-357/ 
1515f85.85V27-rc737 


0% 


0% 


5% 


ND 


1515-27-357/ 
1515f85.85V-28-rc793 


0% 


0% 


55% 


ND 


1515-26-36/ 
1515-26-rc233 


0% 


5% 


27% 


ND 


1 5 1 5(rc34 1 ,rc34 1 )-27-(-28 1 )/ 
1515-26-rc233 


0% 


0% 


0% 


1.5% 


1515(rc341,rc341)-26-(-363)/ 
!5t5-26-rc233 


0% 


0% 


0% 


33% 



* Primer label numbers contained in parentheses, i.e., (85, 85) and (rc341, rc341), indicate that 



these primers were derived from fragments generated by the single primers, 1515-26-85 and 
1515-26-rc341. 

Primer pairs that produced no false positives at 5 °C below the standard 
annealing temperature of 70 °C were considered as candidates for a FCIl-based 
L, monocytogenes assay. Of all the primer sets tested, only 1515-26-36/ 
5 15 15-26-rc233 showed an unacceptable onset of false positive responses at 65 °C. 

Figures 4A-4C are gels illustrating the appearance of anomalous 
amplification products as the annealing temperature was reduced for the 
1 5 1 5-26-36/ 1 5 1 5-26-rc233 primer pair The gel lanes are identified as follows in 
Table 7. 
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TARf C 

1 AtJLt 


— 






Lane 


Strain 


i 
i 


l. seeligen $ lis 7 4 


I 7 


L. innocua #3429 


D 


BLANK 


1 o 


L. tvanovtt #33 j / 


") 


/ r n A j*** ii ,( FT _J (J ' / 


t o 


L. wetshttneri ffjjjis 


_1 


L. ivanovtt #30 72 


20 


l. innocua #35/1 


I 

4 


L. innocua It 324 1 


2 I 


L. innocua ffJ/9/ 


5 


L. innocua #3244 


^ i 
j_ ^ 


L. seeligen #3o2o 


6 


seeligen $3327 


23 


L. innocua #4U94 


/ 


L. ivanovii #3340 




L. innocua. ff4iui 


3 


Z,, innocua #JJ5J 




L. innocua #4323 


9 


L. welshimeri #3354 


B 


BLANK 


10 


L. grayi #3356 


26 


L. seeligeri #4333 


11 


L. welshimeri #3359 


27 


L. innocua #4442 


12 


L. seeligen #3371 


28 


L. innocua #4450 


13 


L. seeligeri #3374 


29 


L, innocua #4452 


B 


BLANK 


30 


L. innocua #4463 


14 


L. welshimeri #3411 


31 


L. monocytogenes #3847 


15 


L, welshimeri #3412 


32 


L, monocytogenes #4324 


16 


L. innocua #3420 


33 


L, monocytogenes #4341 



EXAMPLE 5 

FINAL EVALUATION OF LISTERIA MONOCYTOGENES 

DIAGNOSTIC PRIMERS 
Example 5 illustrates the inclusiveness of the identified Listeria 
5 monocytogenes diagnostic primers for all strains of Listeria monocytogenes. 

The accuracy of all candidate primer sets was evaluated for a larger group of 
L. monocytogenes strains. A set of 323 strains of L. monocytogenes was used to 
evaluate the inclusivity of a set of three primer sets. PCR assay conditions were 
the sa-^e as those specified above except that only a 70 °C annealing temperature 
10 was used. In PCR-based assays using these three primer sets, the accuracy was as 
follows: 

1) 1 5 1 5-27-357/1 5 1 S(8585)-27-rc737, 100%; 

2) 1515-27-357/i5I5(8585)-28-rc793, 99.2%, and 

3) 15l5(rc341x2)-26-363/15l5-26-rc233, 99.5%. 

15 Although primer -28l/rc233 was not tested, its inclusivity response was 

expected to be comparable to 1515(rc34lx2)-26-363/1515-26-rc233. None of the 
three primer sets generated amplification products for the 30 non-monocy to genes 
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strains in the annealing stringency test panel Examples of the positive and 
negative test panel results for the 1 5 I 5-27-357/ 1 5 I 5(8 585)-27~rc737 primer pair 
are shown in Figures 5 A and 5B respectively Figures 5 A and 5B each show a gel 
analysis which corresponds to the strains listed in Table 8 

TABLE 8 

A Positive test panel response for PCR B Negative test panel response for PCR 

products generated from pnmer set products generated from pnmer set 1515-27- 



Lane 


15 15-27-357/1 5 1 5(8585)-27-rc737 
Strain 


Lane 


357/15l5(8585)-27-rc 
Stram 


1 


L. monocytogenes #652 


I 


L. seeligen #2874 


2 


L. monocytogenes 01049 


B 


BLANK 


3 


L. monocytogenes #936 


2 


L. innocua #2921 


4 


L. monocytogenes #954 


3 


L. ivanovii #3072 


5 


L. monocytogenes #957 


4 


L. innocua #3241 


6 


L. monocytogenes #952 


5 


L. innocua #3244 


7 


L< monocytogenes #946 


6 


L. seeligen #3327 


8 


L. monocytogenes #955 


7 


L. ivanovii #3340 


9 


L. monocytogenes #937 


8 


L. innocua #3352 


B 


BLANK 


9 


L. welshimeri #3354 


10 


L, monocytogenes #1051 


10 


L. grayi #3356 


1 1 


L, monocytogenes #1046 


11 


L. welshimeri #3359 


12 


L. monocytogenes #1067 


12 


L. seeligen #3371 


I-j 


L. monocytogenes ffiujj 




jl. seeugen itJj/y 


14 


L. monocytogenes #1057 


B 


BLANK 


15 


L. monocytogenes #1087 


14 


L. welshimeri #3411 


16 


L. monocytogenes #1145 


15 


L. welshimeri #3412 


17 


L. monocytogenes #1146 


16 


L. innocua #3420 


18 


L. monocytogenes #1153 


17 


L. innocua #3429 


B 


BLANK 


18 


L. ivanovii #3357 


19 


L. monocytogenes #1144 


19 


L. welshimeri #3558 


20 


L. monocytogenes #1322 


20 


L. innocua #3571 


21 


I. monocytogenes #1287 


21 


L. innocua #3797 


22 


L. monocytogenes #1316 


22 


L. seeligeri #3828 


23 


L. monocytogenes #1306 


23 


L. innocua #4094 


24 


L. monocytogenes #1298 


24 


L. innocua #4101 


25 


L. monocytogenes #1302 


25 


L. innocua #4323 


26 


L. monocytogenes #1285 


B 


BLANK 


27 


L, monocytogenes #1286 


26 


L. seeligen #4333 
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B 


BLANK 


~i -! 

w t 


L. 


innocua #4442 


2S 


L. monocytogenes It 1 294 


:s 


L. 


innocua #4450 


29 


L. monocytogenes #1283 


29 


L. 


innocua #4452 


30 


L. monocytogenes #1288 


30 


L 


innocua #4463 


M 


L. monocytogenes #1284 


3 I 


L. 


monocytogenes #3847 


3: 


L. monocytogenes #1282 


32 


L. 


monocytogenes #4324 




L. monocytogenes #1464 


.> J 


L. 


monocytogenes #4341 



EXAMPLE 6 

SELECTIVITY TESTING FOR LISTERIA SPP DIAGNOSTIC PRIMERS 
Selection of Listeria sop. Diagnostic Primers 

A comparison of sequence data for the five Listeria species made it possible 
5 to identify priming sites that were at least 90% homologous to L. monocytogenes. 
The following priming sites were selected as possible candidates: 76, 88, 624, 
rc483, rc555, rc573, and rc324. Primers to these sites were made 30 bases in 
length to compensate for mismatches in sequence and to assure that a 70 °C 
annealing temperature could be maintained. Figure 6 shows the primer sequences 

10 and a comparison of priming site sequences for strains representing the following 
species: L. monocytogenes, L. innocua, L, seeligeri, L. welshimeri and 
L. ivanovii. Primer pairs from this group were first evaluated with a 33-strain test 
panel consisting of 1 5 L. innocua, 5 L. weishimeri, 6 L. seeligeri, 3 L. ivanovii, I 
L. grayi and 3 L. monocytogenes. Amplifications were first carried out at an 

15 annealing temperature of 70 °C. The results of this evaluation are summarized in 
Table 9. 



20 



TABLE 9 

% Positive Response for PCR-Based Assay of L. spp. : 
Effect of Priming Site Location 



Priming Sites: 
rc483 
rc555 
rc 573 
rc824 



76 
80% 
100% 
1 00% 

0% 



88 

85% 
100% 
100% 
Not Determined 



624 



Not Applicable 
Not Applicable 
Not Applicable 

0% 



The 76, 88, rc555 and rc573 priming sites all appear to be viable at a 70 °C 
annealing temperature, based on the 100% positive response that is achieved with 
any combination of these primers. When rc483 is used in conjunction with 76 or 
88 the positive response drops off to the 80-85% response range Decreasing the 
annealing temperature to 65 °C increases the response to 97%. Although both 
76/rc824 and 624/rc824 generated no PCR product with a 70 °C annealing 
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temperature, when the temperature was decreased to 65 X the positive response 
increased to 97% In ali of the tests at 65 °C the only false negative was a single 
L. grayi strain. It appears that the melting temperatures of the rc824 and rc483 
primers are too low to permit their effective use at a 70 °C No further testing was 
5 done with these primers The 76, 88, rc555 and rc573 priming sites all appear to 
be viable at a 70 °C annealing temperature. 

L. grayt and L. murrayi are not as frequentiv encountered as other L. spp. 
However, since no sequence data was determined for these species, additional 
strains were tested with the primer sets that scored 100% The test group 
10 consisted of 7 strains of L. grayi and 4 strains of L. murrayi. The positive 
response of the various primer sets was as follows: 
76/rc555 50%, 
76/rc573 18%; 
88/rc555 27%; and 
15 88/rc573 9%. 

The weak and variable response of L. grayi and L. murrayi to the genus 
level priming sites is not surprising. DNA-DNA hybridization studies have shown 
that L. grayi and L. murrayi were only moderately related to reference strains of 
L, monocytogenes, L. innocua, L. seeligeri, L. welshimeri andL. ivanovii, i.e., at 
20 3-29% and 1-9% respectively. (Rocourt et al., Curr. Microbiol 7:383-388 
(1982).) Based on this and other DNA-DNA hybridization studies, it has been 
suggested that L. grayi and L. murrayi are sufficiently different from 
L. monocytogenes to merit their separation into a new genus, Murraya grayi. The 
issue of how these species should be classified is currently undecided. Regardless 
25 of how strains from these two species are ultimately classified, the genus level 
primers show a generally weak positive response to strains from this group. The 
strength of the positive response is expected to be extremely dependent on the 
level of DNA used in the assay with sensitivity to L. grayi and L. murrayi strains 
expected to run 3 orders of magnitude poorer than other Listeria species. 
30 EXAMPLE 7 

SELECTION OF LISTERIA SPP. DIAGNOSTIC PRIMERS ON THE BASIS 
OF SENSITIVITY TO ANNEALING TEMPERATURE 
To minimize the likelihood that L, spp, primers would generate PCR 
products from mismatched priming in non-Listeria strains, the relationship 
35 between annealing temperature and amplification specificity was evaluated. 
Amplifications were carried out on a test panel of 7 strains at an annealing 
temperature of 70 °C Annealing temperature was then decreased in 5 °C 
increments until the onset of nonspecific amplification products. The primer 
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combinations of 76/rc573 and 88/rc573 both showed nonspecific products at 
65 °C At 60 °C 88/rc555 began to show nonspecific products. Such products 
were not observed for the 76/rc555 primer set until the annealing temperature was 
reduced to 55 °C Since the 76/rc555 primer set was least likely to generate 
5 nonspecific amplification products this primer set was the primary candidate for 
the Listeria spp. detection assay. 

EXAMPLE 8 

FINAL EVALUATION OF LISTERIA SPP. DIAGNOSTIC PRIMERS 
Final Evaluation of Listeria spp. Candidate Primer Set 

w Before the 76/rc555 primer selection was confirmed, the accuracy of this set 

was evaluated for additional L. spp, strains. This entire test panel consisted of 73 
strains that were broken down by species as follows: 9 L. monocytogenes, 34 
L. innocua, 5 L. weishimeri, 1 i L. seeligeri, 3 L. ivanovii, 1 L. grayi and 4 
L. murrayi. PCR conditions were the same as those specified above except that 

15 only a 70 °C annealing temperature was used. All of the strains belonging to the 
species L. monocytogenes, L. innocua, L. welshimeri, L. seeligeri, and 
L. ivanovii, tested at 100% positive for the 76/rc555 primer set. As reported 
above, the test group of 7 L, grayi and 4 L, murrayi strains scored at 50%, 
The accuracy of the 76/rc555 primer set was also evaluated on a non- 
20 Listeria spp. test panel consisting of 65 strains that represent a variety of related 
gram positive strains and enteric gram negative strains. The species tested are 
summarized in Table 10. 

TABLE 10 

Negative Test Panel for Screening of Listeria spp. Primer Set 
l5l5-30-76 / l5l5($5$5)-30-rc555 

Species No. of Strains Species No. of Strains 

Aeromonas species 1 Salmonella enteritidis 3 

Bacillus cereus 4 Salmonella redlands 1 

Bacillus subtilis 2 Salmonella virchow 1 

Bacillus thuringiensis 3 Salmonella Santiago 2 

Carnobacterium piscicola 2 Staphylococcus aureus 4 

Enterococcus casseliflavus 1 Staphylococcus carnosus 2 

Enterococcus faecalis 10 Staphylococcus epidermidis 6 

Enterococcus faecium 1 Staphylococcus species 1 

Enterococcus species 1 Staphylococcus warneri 2 

Escherichia coli 2 Staphylococcus xylosus I 

Lactococcus lactis 9 Brochothrix thermosphaaa 8 
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None of the strains in Table 10 produced a positive PCR response for annealing 
temperatures ranging from 60-70 °C Examples of the results of Listeria spp~ and 
non-Listeha spp. test panels for the 76/rc555 primer pair are shown by gel 
analysis in Figures 7A and 7B respectively The lanes are identified as follows in 
Table 1 1 

TABLE 1 1 

Listeria spp. positive test panel response for PCR products generated from primer set 
15 15-30-76 / 15 15(8585)-30-rc555 corresponding to Figure 4 

Lane Strain Lane Strain 



i 

B 

2 

3 

4 

5 

6 

7 

8 

9 

10 
11 

12 

13 

B 

14 

15 

16 



L. seeligeri #2874 
BLANK 

L. innocua #2921 
L, ivanovii #3072 
L. innocua #3241 
L. innocua #3244 
L. seeligeri #3327 
L. ivanovii #3340 
L. innocua #3352 
L. welshimeri #3354 
I. grayi #3356 
L. welshimeri #3359 
L. seeligeri #3371 
L. seeligeri #3374 
BLANK 

L. welshimeri #3411 
L. welshimeri #3412 
L. innocua #3420 



17 

18 

19 

20 

21 

22 

23 

24 

25 

B 

26 

27 

28 

29 

30 

31 

32 

33 



L. innocua #3429 
L. ivanovii #3357 
L, welshimeri #3558 
L. innocua #3571 
L. innocua #3797 
L. seeligeri #3828 
L. innocua #4094 
L. innocua #4101 
L. innocua #4323 
BLANK 

L, seeligeri #4333 
L. innocua #4442 
L. innocua #4450 
L. innocua #4452 
L, innocua #4463 
L. monocytogenes #3847 
L, monocytogenes #4324 
L. monocytogenes #4341 
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Lane 



TABLE 12 

Listeria spp. negative test panel response for PCR products generated from primer set 

l5t5-30-76 / 1515(8585)-3O-rc555 

Strain 



Lane 



Strain 



I Escherichia coli #642 

- Staphylococcus epidermidis #764 

3 Staphylococcus epidermidis #783 

4 Staphylococcus aureus #789 

5 Staphylococcus epidermidis #796 

6 Staphylococcus warnen #797 
1 Staphylococcus warneri #799 

8 Staphylococcus aureus #895 

9 Carnobacterium piscicola #920 

10 Staphylococcus aureus #923 
BLANK 

i i Bacillus subtilis #1011 

12 Bacillus subtilis #1041 

13 Staphylococcus camosus #1090 

14 Staphylococcus carnosus #1091 

1 5 Staphylococcus xylosus #1 1 20 

1 6 Carnobacterium piscicola #1160 

17 Bacillus thuringiensis #1221 

18 Staphylococcus aureus #2095 

1 9 Enterococcus faecalis #3074 

20 Lactococcus lactis #3584 
BLANK 

2 1 Lactococcus lactis #3585 

22 Lactococcus lactis #3586 



23 Lactococcus lactis #3587 

24 Lactococcus lactis #3588 

25 Lactococcus lactis #3589 

26 Lactococcus lactis #3590 

27 Lactococcus lactis #3591 

28 Escherichia coli #3803 

29 Lactococcus lactis #381 7 

30 Aeromonas species #3818 
BLANK 

3 i Enterococcus faecalis #3837 

32 Enterococcus faecalis #3838 

33 Enterococcus species #4095 

34 Enterococcus faecium #4428 

35 Salmonella redlands #4563 

36 Salmonella enteritidis #4565 

37 Salmonella enteritidis #4593 

38 Bacillus thuringiensis #4941 

39 Bacillus thuringiensis #5083 

40 Enterococcus faecalis #5504 

41 Salmonella virchow #5508 

42 Enterococcus casseliflavus #5574 
BLANK 

43 Listeria monocytogenes #3844 

44 Listeria monocytogenes #3278 
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SEQUENCE LISTING 



i. » < 



;eneral information 



(i; APPLICANT: KAZ £1, JAMES WILLIAM 

JENS EN , MARK ANTON 



I 1 i 



ITLE 



INVENTION: GENETIC MARKERS .AND METHODS FOR 

THE DETECTION OF LISTERIA 
MONOCYTOGENES AND LISTERIA SPP. 



(ixi) NUMBER OF SEQUENCES: UO 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: S. I. DU PONT DE NEMOURS AND COMPANY 
STREET: 1007 MARKET STREET 



(B) 
(C) 
( D } 
(E) 
(FJ 



CITY: WILMINGTON 
STATE: DELAWARE 
COUNTRY: U.S.A. 
ZIP: 19898 



-.5 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.50 INCH DISKETTE 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: MICROSOFT WINDOWS 3.1 

(D) SOFTWARE: MICROSOFT WORD 2 * 0C 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
{ B) FILING DATE: 
(C) CLASSIFICATION: 



'Si 



(vii) PRIOR APPALICATION DATA: 

(A) APPLICATION NUMBER: 08/745,228 

(B) FILING DATE : NOVEMBER 8, 1996 

(viiij ATTORNEY/AGENT INFORMATION: 

(A) NAME: FLOYD, LINDA AXAMETHY 

(B) REGISTRATION NO. : 33,692 

(C) REFERENCE/ DOCKET NUMBER: MD-1065-A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 302-892-8112 

(B) TELEFAX: 302-773-0164 
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i 



INFORMATION FOR 3EQ ID NO : 1 : 

SEQUENCE CHARACTERISTICS: 
:Ai LENGTH: 12 base pairs 
(3) TYPE: nucleic acid 
;C) STRANDEDNESS: single 



TOPOLOGY: linear 



ci; MOLECULE TYPE : DNA ..genomic; 
(XI) SEQUENCE DESCRIPTION: SEQ I D NO : 1 : 
AGCTGATGCT AC 12 

'M 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
v£ (A) LENGTH: 12 base pairs 

{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
y (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AGTCGAACTG TC 12 

H (2) INFORMATION FOR SEQ ID NO: 3: 

M (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTAGTCA" GG CA 12 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

T GC GAT AC CG TA 12 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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» L ' L_i V*- *J u 



DNA ; genomic j 



■:<ii SEQUENCE: DESCRIPTION: 3EQ ID MC : 5 : 
CTACAGCTGA TG 

■2) INFORMATION fOP. 3EQ ID MO : 6 : 

(i: SEQUENCE CHARACTERISTICS: 

Ai LENGTH: L2 base pairs 
(3) TYPE: nucleic acid 
iC) STRANDEDNESS: single 
{D) TOPOLOGY: linear 



,'iij MOLECULE TYPE: DMA (genomic} 

ixi} SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GTCAGTCGAA CT 12 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

| (ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GGCATTAGTC AC 12 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 8: 
CGTATGCGAT AC 12 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
AACCTCGTGT AG 12 
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.:: INFORMATION FOP. SEQ ID NC : 10: 

:l; SEQUENCE C HA PACT ERISTICS : 

(A) LENGTH: 12 base pairs 
(5) TYPE : nucleic acid 

(C) 3TRANDEDNESS : single 
'■ 0 ) TOPOLOGY: linear 

(ii; MOLECULE TYPE: DMA (genomic; 

(xi j SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATTCGGGTA CA 12 

(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: \2 base pairs 
{ B ) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 
GCCCTTAGTG AA 12 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCAGTTATGA CC 12 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCAGCTATCT CT 12 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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4 *. W *J W S — * ^ "-J" 



DNA ; genomic) 



CE DESCRIPTION: 



SEQ I D NO 



• ! 4 - 



AGAAGGCAGT 



(2 



INFORMATION FOR SEQ ID NO : 1 5 : 



;i.< SEQUENCE CHARACTERISTICS: 

:A) LENGTH : 12 base pairs 
; 3) TYPE: nucleic acid 

[C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GGACAGAGCA TA 12 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 12 base pairs 
{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGTTTCGCTT CA 12 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
TGCTGTTTGG TTTGCTCTAG CCCAGTG 27 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CAACTTTCCA CATGGCGCGA TTATTTG 27 
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I M FO PMAT I ON FOR 3 EQ ID N" O : : 9 : 



SEQUENCE CHARACTERISTICS : 
iAj LENGTH: 26 base pairs 
j3j TYPE: nucleic acid 
(C 1 STRANDEDNES5 : single 
O; TOPOLOGY: linear 



''Li; MOLECULE TYPE: ON A 'genomi:, 

(XI > SEQUENCE DESCRIPTION: 3EQ ID MO : I 9 



GGGGAACTGC CGAAGATCGT ACAGCA 



26 



(2) 



INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: L MONO - 647 - PREMARKER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGACAGAGCA TAGTTGGATG GAAACAATCC GAT CAGCGGG AACGTTTTTG ATAGATTCTG 



60 



CTGTTTGGTT TGCTCTAGCC CAGTGCCACG TGTCACAAAT AAGTTGCGCA TTATCTCGGC 120 

CGCATGCTTC TGCTACTCGC CAAGCTGCTT GTAAGTCTGC TACACCGCTA TATGGCATAA 18 0 

ATTCTAAACC AATAATTAAT TCTTCTGCGC GGTCACATAA TTCACCAAGC GCGACGATGA 240 

TTTGTTCCTC AGGGATTTTT TCAAGCAAAC CACAATTAAT ATGTTTGACG CCAAATAATC 300 

GCGCCATGTG GAAAGTTGTT TGCTCTTTCT TTTGTTGTTC TGCTGTACGA TCTTCGGCAG 360 



TTCCCCACTG AGTTATGTAC TCCACTTCTG TTACTTTCAT GTTATGCTCT GTCC 



414 



(2) 



INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(B) STRAIN : L MONO - 647 - PREMARKER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGACAGAGCA TAACATGAAA GTAACAGAAG TGGAGTACAT AACTCAGT GG GGAACTGCCG 
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AAGAT C GT AC AG C AG AACAA CAAAAGAAAG AGCAAACAAC TTTCCACATG GCGCGATTAT 12 0 
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< - L . --Art AL-A * j-\ * l AA . l j i GG * . . t . jrtArtAAA . i jrtuUM -.AAA * ~A k ^ _ -j J 

■j • TCGCGCTTGG TGAATTATGT GACCGCGCAG AAGAATTAAT 7ATTGGTTTA GAATTTATGC : 4 ; 

j C AT AT AG CGG TGTAGCAGAC TTACAAGCAG CTTGGCGAGT AGCAGAAGCA TGCGGCCGAG 30 0 

ATAATGCGCA ACTTATTTGT GACACGTGGC ACTGGGCTAG AGCAAACCAA ACAGCAGAAT 360 

/"* m n rri "\ "V ^ ■* A .-"■* <<~* t*r\ fT 1 *^ ^ *T* ^ *\ T' r ■» ^ / ' "\ /-» *TI rft A T* A *\ ^ A T 1 ti fT* ^ r-* * ^ * 

^ _ «. A jl GAAAA-a %j 1 . ~ v- <. on 1 utjA j. I j 1 I ~A * ^,A iAlGv^i--- j . ^ 4-4 

(2; IN FORMAT I ON FOR SEQ ID NO: 22: 

^ ;d SEQUENCE CHARACTERISTICS: 

;| (A) LENGTH: 414 base pairs 

"! (3) TYPE: aucleic acici 

(C) STRANDEDNESS : single 

{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

m (vi) ORIGINAL SOURCE: 

(A) ORGANISM: L MONO - 1324 - PREMAJRKER 

(xi) SEQUENCE ASCRIPTION: SEQ ID NO:22: 

GGACAGAGCA TAGTTGGATA GAAACAATCC GATCAGCGGG AACATTTTTG ATAGATTCAG 60 

1 

CTGTTTGATT TGCTCTAGCC CAGTGCCATG TGTCACAAAT AAGTTGCGCG TTATCTCGTC 120 

$ CGCATGCTTC TGCTACTCGC CAAGCTGCTT GTAAGTCTGC TACACCGCTA TATGGCATAA 180 

ATTCTAAACC AATAATTAAT TCTTCTGCGC GGTCACATAA TTCACCAAGC GCGACAATGA 240 

TTTGTTCCTC AGGGATTTTT TCAAGCAAAC CACAATTAAT ATGTTTGACG CCAAATAATC 300 

GCGCCATGTG GAAAGTTGTT TGCTCTTTCT TTTGTTGTTC TGCTGTACGA TCTTC GGCAG 360 

TTCCCCACTG GGTTATGTAC TCCACTTCTG TTACTTTCAT GTTATGCTCT GTCC 414 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

4 (iv v ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(3) STRAIN: L MONO - 1324 - PREMARKER 



.1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGACAGAGCA TAACATGAAA GTAACAGAAG TGGAGTACAT AACCCAGTGG GGAACTGCCG 60 

AAGATCGTAC AG CAGAAC AA CAAAAGAAAG AGCAAACAAC TTTCCACATG GCGCGATTAT 120 

TTGGCGTCAA ACATATTAAT TGTGGTTTGC TTGAAAAAAT CCCTGAGGAA CAAATCATTG 180 

TCGCGCTTGG TGAATTATGT GACCGCGCAG AAGAATTAAT TATTGGTTTA GAATTTATGC 240 
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:ATATAGCGG TG7AGCAGAC TTACAAGCAG CTTGGCGAGT AGCAGAAGCA TGCGGACGAG 3 00 
; ' ATAACGCGCA ACTTATTTOT GACACATGGC ACTGGGCTAG AG C AAAT C AA ACAGCTGAAT 3 60 

'■ CTATCAAAAA TGTTCCCSCT GATCGGATTG TTTCTATCCA ACTATGCTCT GTCC 4 14 

■; 2 ■ INFORMATION ?OR 3SQ ID MO: 24: 

n i; SEQUENCE CHARACTERISTICS : 

" : A) LENGTH: 1354 base pairs 

,3) TYPE: nucleic acid 

3TRANDEDNES3 : single 
% (D) TOPOLOGY: linear 

>i 

;ii; MOLECULE TYPE: DNA (genomic 

(vij ORIGINAL SOURCE: 
- ( B } STRAIN: L MONO - 64 7 - 0. ERAG 

(Xl) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

TAAGCAAGGA AGTATCTGAT AAAGTCCATC TGTATTTGCA TAGTTGTTAC ACATTGGCGA 60 

T AC GACAAAT CTGTTAGGCA CCTTCATCGG CCCGATATCA ATAGGTGAAA ACATCGAATT 120 

AAATTTCAAA AATAACACAT TCCTTTCACA GGGAGTCTTC CTACTACGTT ATTATTTTCA 180 

CATAAGCGAG TAGGTGTTTA GCGTGGAGAA ATTTCTGGCC ATGCTTCGTC TAATACTTTT 2 40 

4 TTCGTGGCAT TGTATACTTT AAGGGCGGCA TACTCTAAAC CAGTTGCTAC CATAGAGTCT 300 

'J 

GAAATAACTT CGACTCCCAT CACACGTGGA TTTACACCAT GTTCTTTTAA AATTTTTGCA 360 

AAACCGACCG TATCTCCGTA GCCTTCTCCA GGAGCTAGAC GATCATGAAG TGATTCCTCA 420 

CGAAGTTCTT TGTAAGGCGT TTCGTGGACA TCGCATAGTT GGATGGAAAC AATCCGATCA 480 

GCGGGAACGT TTTTGATAGA TTCTGCTGTT TGGTTTGCTC TAGCCCAGTG CCACGTGTCA 54 0 

CAAATAAGTT GCGCATTATC TCGGCCGCAT GCTTCTGCTA CTCGCCAAGC TGCTTGTAAG 600 

TCTGCTACAC CGCTATATGG CAT AAAT T C T AAACCAATAA TTAATTCTTC TGCGCGGTCA 660 

CATAATTCAC CAAGCGCGAC GATGATTTGT TCCTCAGGGA TTTTTTCAAG CAAACCACAA 72 0 

TTAATATGTT TGACGCCAAA TAATCGCGCC ATGTGGAAAG TTGTTTGCTC TTTCTTTTGT 780 

TGTTCTGCTG TACGATCTTC GGCAGTTCCC CACTGAGTTA TGTACTCCAC TTCTGTTACT 84 0 

TTCATGTTAT GCTCGTCTAA AATCCGCAAC ^PGTCTTCAT CCGTTAATCC GGCAGCTAGT 900 

GCATCTACAT AGTTTTCTGC ACGCAAGCCA ATTCCGTCAA AACCGTTTTC CGCAGCGACT 960 

■5 TTTACTCGTT TAGGAAAAGA TACCTCTGTT CCAAGCGTAT AAGAGCTAAT CGTGATGGGG 1020 

CATTTTTTTA GGTTGCCATT TGCATTTGTC ATAAAAATTA TCTCCTCTCC ATAATAAAAA 1080 

T T AC AAGAAA CTTTGATAAT ATTTTCACAA ACACCAGTAA AAAAATTAAT TCCGCTTAAT 114 0 

TAAAAACCTC TGATGTGATA ACGCCTTCAA TAGTTGAAAA TGGAACTGGA CAGTTAACCT 120 0 

ATTCTACCGT ATATTGGTTT TTAAGGAATA GTTTATTTCA CTGGCGTAAC TACAGTCTAA 1260 

TTGTATTATG ACTATTCCAT AAAAACAAAT TGGTATTGTT CTATTAATTG ATAGATAAAT 132 0 
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7 CC AT AG AT A ACTTTTTAGT TAGGAGAGAA GCAT .354 

-.2) INFORMATION FOR SSQ ID NO : 2 3 : 

sij 5 ECjUENCE CHARACTERISTICS : 

IA/ LENGTH: 1354 base pa::3 
(3) TYPE: .-jle:: a::d 
;C) 5TRANDEDNESS : single 
fD) TOPOLOGY: linear 

(ii; MOLECULE TYPE: DNA igenor.:^ 

(iv) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: L MONO 647 - D.F. 

(xi) SEQUENCE DESCRIPTION: SEQ 1 0 N0:25: 

ATGCTTCTCT CCTAACTAAA AAGTTATCTA TGCAATTTAT CTATCAATTA ATAGAACAAT 60 

ACCAATTTGT TTTTATGGAA TAGTCATAAT ACAATTAGAC TGTAGTTACG CCAGTGAAAT 120 

AAACTATTCC TTAAAAACCA ATATACGGTA GAATAGGTTA ACTGTCCAGT TCCATTTTCA 18 0 

ACTATTGAAG GCGTTATCAC ATCAGAGGTT TTTAATTAAG CGGAATTAAT TTTTTTACTG 24 0 

GTGTTTGTGA AAATATTATC AAAGTTTCTT GTAATTTTTA TTATGGAGAG GAGATAATTT 3 00 

TTAT GACAAA TGCAAATGGC AACCTAAAAA AATGCCCCAT CACGATTAGC TCTTATACGC 3 60 

TTGGAACAGA GGTATCTTTT CCTAAACGAG TAAAAGTCGC TGCGGAAAAC GGTTTTGACG 420 

GAATTGGCTT GCGTGCAGAA AACTATGTAG ATGCACTAGC TGCCGGATTA ACGGATGAAG 480 

ACATGTTGCG GATTTTAGAC GAG C ATAACA TGAAAGTAAC AGAAGTGGAG TACATAACTC 54 0 

AGTGGGGAAC TGCCGAAGAT CGTACAGCAG AACAACAAAA GAAAGAGCAA ACAACTTTCC 600 

ACATGGCGCG ATTATTTGGC GTCAAACATA TTAATTGTGG TTTGCTTGAA AAAATCCCTG 660 

AGGAACAAAT CATCGTCGCG CTTGGTGAAT TATGTGACCG CGCAGAAGAA TTAATTATTG 72 0 

GTTTAGAATT TATGCCATAT AGCGGTGTAG CAGACTTACA AGCAGCTTGG CGAGTAGCAG 7 80 

AAGCATGCGG CCGAGATAAT GCGCAACTTA TTTGTGACAC GTGGCACTGG GCTAGAGCAA 84 0 

ACCAAACAGC AGAATCTATC AAAAACGTTC CCGCTGATCG GATTGTTTCC ATCCAACTAT 900 

GCGATGTCCA CGAAACGCCT TACAAAGAAC TTCGTGAGGA ATCACTTCAT GATCGTCTAG 960 

CTCCTGGAGA AGGCTACGGA GATACGGTCG GTTTTGCAAA AATTTTAAAA GAACATGGTG 102 0 

TAAATCCACG TGTGATGGGA GTCGAAGTTA TTTCAGACTC TATGGTAGCA ACTGGTTTAG 108 0 

AGTATGCCGC CCTTAAAGTA TACAATGCCA CGAAAAAAGT ATTAGAC GAA GCATGGCCAG 114 0 

AAATTTCTCC AC GCT AAACA CCTACTCGCT TATGTGAAAA TAATAACGTA GTAGGAAGAC 1200 

TCCCTGTGAA AGGAATGTGT TATTTTTGAA ATTTAATTCG ATGTTTTCAC CT ATT GAT AT 1260 

CGGGCCGATG AAGGTGCCTA ACAGATTTGT CGTATCGCCA ATGTGTAACA ACT AT GC AAA 132 0 
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INFORMATION FOR 3 EQ ID MO: 26: 



-) sequence: CHARACTERISTICS : 

i A ) LENGTH: L327 base pairs 
(3) TYPE: nucleic acid 
C) 3TRANDEDNE33 : single 
(D) TOPOLOGY; linear 

;ui MOLECULE TYPE: DNA (genomic, 

(vi) ORIGINAL SOURCE: 

{B} STRAIN : L MONO 899 D.E. 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 26: 



CGGCCCGATA TCAATAGGTG AAAACATCGA ATTAAATTTC AAAAATAACA CATTCCTTTC 12 0 

ACAGGGAGTC TTCCTACTAC GTTATTATTT TCACATAAGC GAGTAGGTGT TTAGCGTGGA 180 

GAAATTTCTG GCCATGCTTC GTCTAATACT TTTTTCGTGG CATTGTATAC TTTAAGGGCG 2 40 

GCATACTCTA AACCAGTTGC TACCATAGAG TCTGAAATAA CTTCGACTCC CATCACACGT 300 

GGATTTACAC CATGTTCTTT TAAAATTTTT GCAAAACCGA CCGTATCTCC GTAGCCTTCT 360 

CCAGGAGCTA GACGATCATG AAGTGATTCC TCACGAAGTT CTTTGTAAGG CGTTTCGTGG 420 

ACATCGCATA GTTGGATGGA AACAATCCGA TCAGCGGGAA CGTTTTTGAT AGATTCTGCT 4 80 

•GTTTGGTTTG CTCTAGCCCA GTGCCACGTG TCACAAATAA GTTGCGCATT ATCTCGGCCG 540 

CATGCTTCTG CTACTCGCCA AGCTGCTTGT AAGTCTGCTA CACCGCTATA TGGCATAAAT 600 

TCTAAACCAA TAATTAATTC TTCTGCGCGG TCACATAATT CACCAAGCGC GACGATGATT 660 

TGTTCCTCAG GGATTTTTTC AAGCAAACCA CAATTAATAT GTTTGACGCC AAATAATCGC 72 0 

GCCATGTGGA AAGTTGTTTG CTCTTTCTTT TGTTGTTCTG CTGTACGATC TTCGGCAGTT 780 

CCCCACTGAG TTATGTACTC CACTTCTGTT ACTTTCATGT TATGCTCGTC TAAAATCCGC 840 

AACATGTCTT CATCCGTTAA TCCGGCAGCT AGTGCATCTA CATAGTTTTC TGCACGCAAG 900 

CCAATTCCGT CAAAACCGTT TTCCGCAGCG ACTTTTACTC GTTTAGGAAA AGATACCTCT 9 60 

GTTCCAAGCG T^AAGAGCT AATCGTGATG GGGCATTTTT TTAGGTTGCC ATTTGCATTT 102 0 

GTCATAAAAA TTATCTCCTC TCCATAATAA AAATTACAAG AAACTTTGAT AATATTTTCA 108 0 

CAAACACCAG TAAAAAAATT AATTCCGCTT AATTAAAAAC CTCTGATGTG ATAACGCCTT 114 0 

CAATAGTTGA AAATGGAACT GGACAGTTAA CCTATTCTAC CGTATATTGG TTTTTAAGGA 12 00 

ATAGTTTATT TCACTGGCGT AACTACAGTC TAATTGTATT ATGACTATTC CATAAAAACA 1260 

AATTGGTATT GTTCTATTAA TT GAT AG AT A AATTGCATAG ATAACTTTTT AGTTAGGAGA 132 0 

GAAGCAT 132 7 



ATCTGTATTT GCATAGT 



TGT T AC AC AT T G G CG^ACGACA AA' 



TCTGTTAG GCACCTTCAT 



60 
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information for 3 eg ld nc:^: 
i ; sequence characteristics ; 

(A) LENGTH: 1327 base pairs 
(3) TYPE: nucleic acid 
(C) 3TRANDEDNE33 : single 
(0! TOPOLOGY: linear 

- li ; MOLECULE TYPE: DNA genomic 

'■lv) ANTI -SENSE; YES 

(vi) ORIGINAL SOURCE: 

(3) STRAIN : L MONO 899 D . E*. 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 7 : 

ATGCTTCTCT CCTAACTAAA AAGTTATCTA TGCAATTTAT CTATCAATTA ATAGAACAAT 60 

ACCAATTTGT TTTTATGGAA TAGT CATAAT ACAATTAGAC TGTAGTTACG C CAGT GAAAT 120 

AAACTATTCC TTAAAAACCA ATATACGGTA GAATAGGTTA ACTGTCCAGT TCCATTTTCA 180 

ACTATTGAAG GCGTTATCAC ATCAGAGGTT TTTAATTAAG CGGAATTAAT TTTTTTACTG 240 

GTGTTTGTGA AAATATTATC AAAGTTTCTT GTAATTTTTA TTATGGAGAG GAGATAATTT 3 00 

TTATGACAAA TGCAAATGGC AACCTAAAAA AATGCCCCAT CACGATTAGC TCTTATACGC 3 60 

TTGGAACAGA GGTATCTTTT CCTAAACGAG TAAAAGTCGC TGCGGAAAAC GGTTTTGACG 420 

GAATTGGCTT GCGTGCAGAA AACT AT GT AG ATGCACTAGC TGCCGGATTA ACGGATGAAG 4 80 

ACATGTTGCG GATTTTAGAC GAGCATAACA TGAAAGTAAC AGAAGTGGAG TACATAACTC 54 0 

AGTGGGGAAC TGCCGAAGAT CGTAGAGCAG AACAACAAAA GAAAGAGCAA ACAACTTTCC 600 

ACATGGCGCG ATTATTTGGC GTCAAACATA TTAATTGTGG TTTGCTTGAA AAAATCCCTG 660 

AGGAACAAAT CATCGTCGCG CTTGGTGAAT TATGTGACCG CGCAGAAGAA TTAATTATTG 720 

GTTTAGAATT TATGCCATAT AGCGGTGTAG CAGACTTACA AGCAGCTTGG CGAGTAGCAG 78 0 

AAGCATGCGG CCGAGATAAT GCGCAACTTA TTTGTGACAC GTGGCACTGG GCTAGAGCAA 84 0 

AC CAAAC AG C AGAATCTATC AAAAACGTTC CCGCTGATCG GATTGTTTCC ATCCAACTAT 90 0 

GCGATGTCCA CGAAACGCCT TACAAAGAAC TTCGTGAGGA AT CACTTCAT GATCGTCTAG 960 

CTCCTGGAGA AGGCTACGGA G AT AC GGTCG GTTTTGCAAA AATTTTAAAA GAACATGG'i ^ 1020 

TAAATCCACG TGTGATGGGA GTCGAAGTTA TTTCAGACTC TATGGTAGCA ACTGGTTTAG 1080 

AGTATGCCGC CCTTAAAGTA TACAATGCCA CGAAAAAAGT ATTAGACGAA GCATGGCCAG 114 0 

AAATTTCTCC ACGCTAAACA CCTACTCGCT TATGTGAAAA TAATAACGTA GTAGGAAGAC 120 0 

TCCCTGTGAA AGGAATGTGT TATTTTTGAA ATTTAATTCG ATGTTTTCAC CTATTGATAT 12 60 

CGGGCCGATG AAGGTGCCTA ACAGATTTGT CGTATCGCCA ATGTGTAACA ACTATGCAAA 132 0 

T ACAGAT 1327 
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INFORMATION* "OP SEQ ZD MO : Z 3 : 

'l SEQUENCE CHARACTERISTICS : 

-A) LENGTH: 121 A case pairs 
(3! TYPE: nucleic acid 
;Ci STRANDEDNE3S : single 
(Di TOPOLOGY: linear 

MOLECULE TYPE: ON A (genomic 

fvi) ORIGINAL SOURCE: 

(B) STRAIN: L MONO 33 8 6 D . F . 

(Xi) SEQUENCE DESCRIPTION: SEQ 10 NO:29: 



AGGCACCTTC 


ATCGGCCCGA 


TATCAATAGG 


TGAAAACATC 


GAATTAAATT 


TCAAAAATAA 


60 


CACATTCCTT 


TCACAGGGAG 


TCTTCCTACT 


ACGTTATTAT 


TTTCACATAA 


GCGAGTAGGT 


120 


GTTTAGCGTG 


GAGAAATTTC 


TGGCCATGCT 


TCGTCTAATA 


CTTTTTTCGT 


GGCATTGTAT 


180 


ACTTTAAGGG 


CGGCATACTC 


TAAACCAGTT 


GCTACCATAG 


AGTCTGAAAT 


AACTTCGACT 


240 


CCCATCACAC 


GTGGATTTAC 


ACCATGTTCT 


TTTAAAATTT 


TTGCAAAACC 


GACCGTATCT 


300 


CCGTAGCCTT 


CTCCAGGAGC 


TAGACGATCA 


TGAAGTGATT 


CCTCACGAAG 


TTCTTTGTAA 


360 


GGCGTTTCGT 


GGACATCGCA 


TAGTTGGATG 


GAAACAATCC 


GATCAGCGGG 


AACGTTTTTG 


420 

T <a \J 


ATAGATTCTG 


CTGTTTGGTT 


TGCTCTAGCC 


CAGTGCCACG 


TGTCACAAAT 


AAGTTGCGCA 


480 


TTATCTCGGC 


CGCATGCTTC 


TGCTACTCGC 


CAAGCTGCTT 


GTAAGTCTGC 


TACACCGCTA 


540 


TAT G G CAT AA 


ATTCTAAACC 


AATAATTAAT 


TCTTCTGCGC 


GGTCACATAA 


TTCACCAAGC 


600 


GCGACGATGA 


TTTGTTCCTC 


AGGGATTTTT 


TCAAGCAAAC 


CACAATTAAT 


ATGTTTGACG 


660 


CCAAATAATC 


GCGCCATGTG 


GAAAGTTGTT 


TGCTCTTTCT 


TTTGTTGTTC 


TGCTGTACGA 


720 


TCTTCGGCAG 


TTCCCCACTG 


AGTTATGTAC 


TCCACTTCTG 


TTACTTTCAT 


GTTATGCTCG 


780 


TCTAAAATCC 


GCAACATGTC 


TTCATCCGTT 


AATCCGGCAG 


CTAGTGCATC 


TACATAGTTT 


840 


TCTGCACGCA 


AGCCAATTCC 


GTCAAAACCG 


TTTTCCGCAG 


CGACTTTTAC 


TCGTTTAGGA 


900 


AAAGATACCT 


CTGTTCCAAG 


CGTATAAGAG 


CTAATCGTGA 


TGGGGCATTT 


TTTTAGGTTG 


960 


CCATTTGCAT 


TTGTCATAAA 


AATTATCTCC 


TCTCCATAAT 


AAAAAT TAG A 


AGAAACTTTG 


1020 


ATAATATTTT 


CACAAACACC 


AGTAAAAAA n 


TTAATTCCGC 


TTAATTAAAA 


ACCTCTGATG 


1080 


TGATAACGCC 


TTCAATAGTT 


GAAAATGGAA 


CTGGACAGTT 


AACCTATTCT 


ACCGTATATT 


1140 


GGTTTTTAAG 


GAATAGTTTA 


TTTCACTGGC 


GTAACTACAG 


TCTAATTGTA 


TTATGACTAT 


1200 


TCCATAAAAA 


CAAATTGGTA 


TTGTTCTATT 


AATTGATAGA 


TAAATTGCAT 


AGATAACTTT 


1260 



TTAGTTAGGA GAGA 12 7 4 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1274 base pairs 
(8) TYPE: nucleic acid 
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fC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

di} MOLECULE TYPE: DNA {genomic) 

{iv) ANTI-SENSE: YES 

(vi( ORIGINAL SOURCE: 

(B) STRAIN: L MONO 3386 D.f 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



TCTCTCCTAA 


CTAAAAAGTT 


ATCTATGCAA 


TTTATCTATC 


AATTAATAGA 


ACAATACCAA 


60 


TTTGTTTTTA 


TGGAATAGTC 


ATAATACAAT 


TAGACTGTAG 


TTACGCCAGT 


GAAATAAACT 


120 


ATTCCTTAAA 


AACCAATATA 


C G GT AGAAT A 


GGTTAACTGT 


CCAGTTCCAT 


TTTCAACTAT 


180 


TGAAGGCGTT 


ATCACATCAG 


AGGTTTTTAA 


TTAAGCGGAA 


TTAATTTTTT 


TACTGGTGTT 


240 


TGTGAAAATA 


TTATCAAAGT 


TTCTTGTAAT 


TTTTATTATG 


GAGAGGAGAT 


AATTTTTATG 


300 


ACAAATGCAA 


ATGGCAACCT 


AAAAAAATGC 


CCCATCACGA 


TTAGCTCTTA 


TACGCTTGGA 


360 


ACAGAGGTAT 


CTTTTCCTAA 


AC GAG T AAAA 


GTCGCTGCGG 


AAAACGGTTT 


TGACGGAATT 


420 


GGCTTGCGTG 


CAGAAAACTA 


TGTAGATGCA 


CTAGCTGCCG 


GAT T AAC G G A 


T GAAGAC AT G 


480 


TTGCGGATTT 


TAGACGAGCA 


T AAC AT G AAA 


GTAACAGAAG 


TGGAGTACAT 


AACTCAGTGG 


540 


GGAACTGCCG 


AAGATCGTAC 


AGCAGAACAA 


CAAAAGAAAG 


AGCAAACAAC 


TTTCCACATG 


600 


GCGCGATTAT 


TTGGCGTCAA 


ACATATTAAT 


TGTGGTTTGC 


TTGAAAAAAT 


CCCTGAGGAA 


660 


CAAATCATCG 


TCGCGCTTGG 


TGAATTATGT 


GACCGCGCAG 


AAGAATTAAT 


TATTGGTTTA 


720 


GAATTTATGC 


CATATAGCGG 


TGTAGCAGAC 


TTACAAGCAG 


CTTGGCGAGT 


AGCAGAAGCA 


780 


TGCGGCCGAG 


ATAATGCGCA 


ACTTATTTGT 


GACACGTGGC 


ACTGGGCTAG 


AGCAAACCAA 


840 


AC AG C AGAAT 


CTATCAAAAA 


CGTTCCCGCT 


GATCGGATTG 


TTTCCATCCA 


ACTATGCGAT 


900 


GTCCACGAAA 


CGCCTTACAA 


AGAACTTCGT 


GAGGAATCAC 


TTCATGATCG 


TCTAGCTCCT 


960 


GGAGAAGGCT 


AC G GAG AT AC 


GGTCGGTTTT 


GCAAAAATTT 


TAAAAGAACA 


TGGTGTAAAT 


1020 


CCACGTGTGA 


TGGGAGTCGA 


AGTTATTTCA 


GACTCTATGG 


TAGCAACTGG 


TTTAGAGTAT 


1080 


GCCGCCCTTA 


AAGTATACAA 


TGCCACGAAA 


A'AGTATTAG 


ACGAAGCATG 


GCCAGAAATT 


1140 


TCTCCACGCT 


AAACACCTAC 


TCGCTTATGT 


GAAAATAATA 


AC GT AG TAG G 


AAGACTCCCT 


1200 


GTGAAAGGAA 


TGTGTTATTT 


TTGAAATTTA 


ATTCGATGTT 


TTCACCTATT 


GAT AT CGGGC 


1260 



CGATGAAGGT GCCT 12 7 4 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1289 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
M (D) TOPOLOGY: linear 
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ID MOLECULE TYPE: DNA (genomic) 

vi) ORIGINAL SOURCE: 

{ B } STRAIN; L MONO 1324 D.F 



SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



m 



CAAATCTGTT AGGCACCTTC ATCGGCCCGA TATCAATAGG TGAAAACATC GAATTAAATT 



CTAAGTAGGT GTTTAGCGTG GAGAAATCTC TGGCCATGCT TCGTCTAATA CTTTTTTCGT 



50 



TCAAAAATAA CACATTCCTT TCACAGGGAG TCTTCCTACT AC G?? AT CAT TTTCACATAA 120 



.80 



GG C ATT GT AT ACTTTAAGAG CGGCATACTC TAAACCAGTT GCTACCATAG AGTCTGAAAT 240 



AACTTCAACT CCCATCACAC GTGGATTTAC ACCATGCTCT TTTAAAATTT TTGCAAAACC 300 

GACCGTATCT CCGTATCCTT CTCCAGGAGC TAAACGATCA TGAAGTGATT CTTCACGAAG 360 

TTCTTTGTAA GGTGTTTCGT GGACATCGCA TAGTTGGATA GAAACAATCC GATCAGCGGG 42 0 

AACATTTTTG ATAGATTCAG CTGTTTGATT TGCTCTAGCC CAGTGCCATG TGTCACAAAT 480 



AAGTTGCGCG TTATCTCGTC CGCATGCTTC TGCTACTCGC CAAGCTGCTT GTAAGTCTGC 54 0 



TACACCGCTA TATGGCATAA ATTCTAAACC AATAATTAAT TCTTCTGCGC GGTCACATAA 



600 



TTCACCAAGC GCGACAATGA TTTGTTCCTC AGGGATTTTT TCAAGCAAAC CACAATTAAT 660 
ATGTTTGACG CCAAATAATC GCGCCATGTG GAAAGTTGTT TGCTCTTTCT TTTGTTGTTC 720 
TGCTGTACGA TCTTCGGCAG TTCCCCACTG GGTTATGTAC TCCACTTCTG TTACTTTCAT 7 80 



GTTATGCTCG TCTAAAATCC GCAACATGTC TTCATCGGTT AATCCGGCAG CTAGTGCATC 



840 



TACATAATTT TCTGCACGCA AGCCAATTCC GTCAAAACCA TTTTCCGCAG CGACTTTCAC 900 

TCGTTTAGGA AAAGATACCT CCGTTCCTAG TGTGTAAGAG CTAATCGTGA TGGGGCATTT 9 60 

TTTTAGATTG CCATTTGCAT TTGTCATAAA AATTATCTCC TCTCCATAAT AAAAATTACA 1020 

AGAAACTTTG ATAATATTTT CACAAACACC AGTAAAAAAA TAAATTCCAC TAAATTAAAA 108 0 

ATCTCTGATG TGATAACGCC TTCAATAGTT AAAAATGGAA CTGGACAGTT AACCTATTCT 1140 

ACCGTATATT GGTTTTTAAG GAATAGTTTA TTTCACTGGC GTAACTACAG TTTAATTGTA 1200 

TTATGACTAT TCCATAAAAA CAAATTGGTA TTGTTCTATT AATTGATAGA TAAATTGCAT 12 60 



■4 



AGATAACTTT TTAGTTAGGA GAGAAGCAT 



1289 



(2 



INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1289 base pairs 
( 8 ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 
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(vi) ORIGINAL SOURCE: 

{B> STRAIN: L MONO 1324 D.F. 

<xi} SEQUENCE DESCRIPTION: SEQ I D NO : 3 1 : 

ATGCTTCTCT CCTAACTAAA AAGTTATCTA TGCAATTTAT CTATCAATTA ATAGAACAAT 50 

ACCAATTTGT TTTTATGGAA TAGTCATAAT ACAATTAAAC TGTAGTTACG CCAGTGAAAT 12C 

AAACTATTCC TTAAAAACCA ATATACGGTA GAATAGGTTA ACTGTCCAGT TCCATTTTTA 130 

ACTATTGAAG GCGTTATCAC ATCAGAGATT TTTAATTTAG TGGAATTTAT TTTTTTACTG 240 

GTGTTTGTGA AAATATTATC AAAGTTTCTT GTAATTTTTA TTATGGAGAG GAGATAATTT 3 00 

T TAT G AC AAA TGCAAATGGC AATCTAAAAA AATGCCCCAT CACGATTAGC TCTTACACAC 3 60 

TAGGAACGGA GGTATCTTTT CCTAAACGAG TGA .GTCGC TGCGGAAAAT GGTTTTGACG 420 

GAATTGGCTT GCGTGCAGAA AATTATGTAG ATGCACTAGC TGCCGGATTA ACCGATGAAG 480 

ACATGTTGCG GATTTTAGAC GAGCATAACA TGAAAGTAAC AGAAGTGGAG TACATAACCC 54 0 

AGTGGGGAAC TGCCGAAGAT CGTACAGCAG AACAACAAAA GAAAGAGCAA ACAACTTTCC 600 

ACATGGCGCG ATTATTTGGC GTCAAACATA TTAATTGTGG TTTGCTTGAA AAAATCCCTG 660 

AGGAACAAAT CATTGTCGCG CTTGGTGAAT TATGTGACCG CGCAGAAGAA TTAATTATTG 720 

GTTTAGAATT TATGCCATAT AGCGGTGTAG CAGACTTACA AGCAGCTTGG CGAGTAGCAG 78 0 

AAGCATGCGG ACGAGATAAC GCGCAACTTA TTTGTGACAC ATGGCACTGG GCTAGAGCAA 84 0 

* 

ATCAAACAGC TGAATCTATC AAAAATGTTC CCGCTGATCG GATTGTTTCT ATCCAACTAT 900 

GCGATGTCCA CGAAACACCT TACAAAGAAC TTCGTGAAGA ATCACTTCAT GATCGTTTAG 960 

CTCCTGGAGA AGGATACGGA GATACGGTCG GTTTTGCAAA AATTTTAAAA GAGCATGGTG 1020 

TAAATCCACG TGTGATGGGA GTTGAAGTTA TTTCAGACTC TATGGTAGCA ACTGGTTTAG 108 0 

AGTATGCCGC TCTTAAAGTA TACAATGCCA CGAAAAAAGT ATTAGACGAA GCATGGCCAG 114 0 

AGATTTCTCC ACGCTAAACA CCTACTTAGT TATGTGAAAA T GAT AAC GT A GTAGGAAGAC 1200 

TCCCTGTGAA AGGAATGTGT TATTTTTGAA ATTTAATTCG ATGTTTTCAC CTATTGATAT 12 60 

CGGGCCGATG AAGGTGCCTA ACAGATTTG 12 8 9 
(2) I^TORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE.* amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: L MONO ORE 



49 



96201 60A1 I > 



WO 98/20160 



PCT/US97/19896 * 



(Xl) SEQUENCE* DESCRIPTION: SEQ ID WO: 32: 

Met Thr Asn Ala Asn Gly Asn Leu Lys Lys Cys Pro lie Thr lie Ser 
15 10 15 

Ser Tyr Thr Leu Gly Thr Glu Val Ser Phe Pro Lys Arg Val Lys Val 

20 25 30 

Ala Ala Glu Asn Gly Phe Asp Gly He Gly Leu Arg Ala Glu Asn Tyr 
35 40 45 



% Val Asp Ala Leu Ala Ala Gly Leu Thr Asp Glu Asp Met Leu Arg lie 

I 5Q 55 * 60 

Leu Asp Glu His Asn Met Lys Val Thr Glu Val Glu Tyr Tie Thr Gin 
65 70 75 80 

^rp Gly Thr Ala Glu Asp Arg Thr Ala Glu Gin Gin Lys Lys Glu Gin 

85 90 95 

Thr Thr Phe His Met Ala Arg Leu Phe Gly Val Lys His He Asn Cys 

100 105 110 

Gly Leu Leu Glu Lys He Pro Glu Glu Gin He He Val Ala Leu Gly 
115 120 125 

Glu Leu Cys Asp Arg Ala Glu Glu Leu He He Gly Leu Glu Phe Met 
4 130 135 140 

a 

Pro Tyr Ser Gly Val Ala Asp Leu Gin Ala Ala Trp Arg Val Ala Glu 
145 150 155 160 

Ala Cys Gly Arg Asp Asn Ala Gin Leu He Cys Asp Thr Trp His Trp 

165 170 17*5 

Ala Arg Ala Asn Gin Thr Ala Glu Ser He Lys Asn Val Pro Ala Asp 

180 185 190 

Arg He Val Ser He Gin Leu Cys Asp Val His Glu Thr Pro Tyr Lys 
195 200 205 

Glu Leu Arg Glu Glu Ser Leu His Asp Arg Leu Ala Pro Gly Glu Gly 
210 215 220 

Tyr Gly Asp Thr Val Gly Phe Ala Lys He Leu Lys Glu His Gly Val 
225 230 235 240 



Asn Pro Arg Val Met Gly Val Glu Val lie Ser Asp Ser Met Val Ala 

245 250 255 

Thr Gly Leu Glu Tyr Ala Ala Leu Lys Val Tyr Asn Ala Thr Lys Lys 

260 265 270 

'A 

Val Leu Asp Glu Ala Trp Pro Glu He Ser Pro Arg 
275 280 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
■4 (D) TOPOLOGY: linear 
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5 (li) MOLECULE TYPE: DNA (genomic) 

■i (vi) ORIGINAL SOURCE : 

■\ (B) STRAIN: L INNOCUA 4 4 50 D.f. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

M CATCTGTATT CGCATAGTTA TTACACATTG GAGATACAAC AAAACGATTT GGTACTCTCA 50 

TAGGCCCGAT ATCAATCGGT GAGAACATCG AATTAAATTT CAAAAAAAAC ACTTCCCTTT 120 

CAAGGTAGAA TCTTCTTGTT ACGTTACTTT TTTCACATGT TTAATTATTT TTTTATTTGG 180 

GAGAAATTTC TGGCCATGCT TCGTCCAATA CTTTTTTCGT TGCATTATAT ACTTTGATGG 2 40 

CAGCATATTC TAAACCAGTT T CT AC CAT GG AGTCGGATAT AACTTCAACG CCCATAACAC 300 

GGGGGTTAAC GCCATGCTCT TTCAAGATTC GAGCAAAACC GACTGTATCG CCGTATCCCT 3 60 

CGCCAGGAGC AAGTCGGTCA TGGAGAGATT CTTCCCGAAG TTCTTTGTAA GGTGTTTCGT 420 

GTACGTCACA TAATTGGATG GAAACAATCC GGTCAGCGGG AACATTTTTG ATTGATTCTG 4 80 

CTGTTTGGTT TGCTCTAGCC GAGTGCCAAG TGTCGCAAAT TAGTTGTGCA TTATCCCTGC 54 0 

CACATGCTTC AGCTACACGC CAAGCTGCTG CTAAGTCTGC TACACCACTG TATGGCATAA 600 

ACTCTAAACC GATAATTAAT TCTTCCGCAC GGTCACACAA TTCACCGAGT GCCGTAATGA 660 

i TTTGTTCTTC GGGGATTTTT TCAAGCAAAC CGCAGTTAAT ATGTTTGACG CCGAATAAAC 72 0 

GCGCCATGTG GAAAGTAGTT TGTTCTTTCT TTTGCTGTTC GGCTGTGCGG TCCTCGGCGG 7 80 

TTCCCCACTG AGTTATGTAT TCTACTTCTG TTACTTTGAT GTTATGCTCA TCTAAAATCC 84 0 

GCAACATATC TTCATCAGTT AATCCAGCGG CTAGTGCGTC TACATAGTTT TCTGCACGTA 900 

AGCCAATTCC ATCAAAACCA TTTTCTGCTG CGATTCTCAC TCGTTCAGGA AAAGATACCT 9 60 

CCGTTCCAAG CGTGTAAGAG CTGATCGTGA TTGGGCATTT TTTTAGGTCG CCATTTGCAT 102 0 

TTGTCATAAA AATTATCTCC TCTCTAGAAT AAAAATTACA AGAAACTTTG ATAATATTTT 10 8 0 

CACAAACACC AGTAAAAAAA TAAATTCCCG TTCATTAAAT ATCGCTGATG TGATAACGCC 114 0 

TTCAATGTTT GAAATTTCAA CTGGACAGTT AACGTATTCT ACCGTATATT GGTTTTTAAG 1200 

GAATAGTTTG TTCTGCTGGT GTAACTACAG TCTAATTGTA TTATGACTAT TCCATAAAAA 12 60 

CAAATTGGTA TTAiTCTATT AATTGATAGA TAAATTGCAT AGATAATTTT TAGTAAGGAG 132 0 

AGAAGCCAT 132 9 
{2} INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
fiv) ANTI-SENSE: YES 
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(vi) ORIGINAL SOURCE : 

(B) STRAIN: L INNOCUA 4450 D.F. 

(Xi) SEQUENCE DESCRIPTION: SEQ XD NO: 34: 



ATGGCTTCTC TCCTTACTAA AAATTATCTA TGCAATTTAT CTATCAATTA ATAGAATAAT 60 
ACCAATTTGT TTTTATGGAA TAGTCATAAT ACAATTAGAC TGTAGTTACA CCAGCAGAAC 12 0 



AAACTATTCC TTAAAAACCA ATATACGGTA GAATACGTTA ACTGTCCAGT TGAAATTTCA 130 
AACATT GAAG GCGTTATCAC ATCAGCGATA TTTAATGAAC GGGAATTTAT TTTTTTACTG 240 
GTGTTTGTGA AAATATTATC AAAGTTTCTT GTAATTTTTA TTCTAGAGAG GAGATAATTT 3 00 



TTATGACAAA TGCAAATGGC GACCTAAAAA AATGCCCAAT CACGATCAGC TCTTACACGC 



360 



TTGGAACGGA GGTATCTTTT CCTGAACGAG TGAGAATCGC AGCAGAAAAT GGTTTTGATG 



420 



GAATTGGCTT ACGTGCAGAA AACTATGTAG ACGCACTAGC CGCTGGATTA ACTGAT GAAG 



480 



ATATGTTGCG GATTTTAGAT GAGCATAACA TCAAAGTAAC AGAAGTAGAA TACATAACTC 54 0 



AGTGGGGAAC CGCCGAGGAC CGCACAGCCG AACAGCAAAA GAAAGAACAA ACTACTTTCC 600 
ACATGGCGCG TTTATTCGGC GTCAAACATA TTAACTGCGG TTTGCTTGAA AAAATCCCCG 660 
AAGAACAAAT CATTACGGCA CTCGGTGAAT TGTGTGACCG TGCGGAAGAA TTAATTATCG 720 



GTTTAGAGTT TATGCGATAC AGTGGTGTAG CAGACTTAGC AGCAGCTTGG CGTGTAGCTG 



780 



AAGCATGTGG CAGGGATAAT GCACAACTAA TTTGCGACAC TTGGCACTCG GCTAGAGCAA 84 0 



ACCAAACAGC AGAATCAATC AAAAATGTTC CCGCTGACCG GATTGTTTCC ATCCAATTAT 



900 



GTGACGTACA CGAAACACCT TACAAAGAAC TTCGGGAAGA ATCTCTCCAT GACCGACTTG 960 

CTCCTGGCGA GGGATACGGC GATACAGTCG GTTTTGCTCG AATCTTGAAA GAGCATGGCG 1020 

TTAACCCCCG TGTTATGGGC GTTGAAGTTA TATCCGACTC CATGGTAGAA ACTGGTTTAG 108 0 

AATATGCTGC CATCAAAGTA TATAATGCAA CGAAAAAAGT ATTGGACGAA GCATGGCCAG 1140 

AAATTTCTCC CAAATAAAAA AATAATTAAA CATGTGAAAA AAGTAACGTA ACAAGAAGAT 12 00 

TCTACCTTGA AAGGGAAGTG TTTTTTTTGA AATTTAATTC GATGTTCTCA CCGATTGATA 12 60 



TCGGGCCTAT GAGAGTACCA AATCGTTTTG TTGTATCTCC AATGTGTAAT AACTATGCGA 1320 



ATACAGATG 



1329 



(2) 



INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1319 base pairs 
{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ' DNA (genomic) 



(vi) ORIGINAL SOURCE: 

(B) STRAIN: L SEELIGERI 3327 D.F, 
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[ ' (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 35: 

TATTTGCATA GTTATTACAC ATCGGCGAAA CCACGAAACG GTTTGGCACA CGCATAGGTC 50 

i CGATATCTAT TGGAGAAAAC ATCGAATTAA ATTTCAAAAA AAACACTTTC CTTTCAAAGG 12 0 

| GGGATTTTTA ACGTTATTAA TTTCACATAG CGATGCGTCA AGTTACTTTG GAGAAACTTC 18 0 

TGGCCATGCT TCATCTAATA CTTTTT TAGT AGCATTATAA ACTTTAATTG CCGCATATTC 2 40 

TAAACCTGTT T CT AC CAT CG AGTCTGAAAT CACTTCCACA CCCATAACAC GCGGATTTAC 3 00 

AC CAT GTTCT TTTAAAATAC GAGCAAAGCC TACAGTGTCG CCGTATCCTT CACCAGGTGC 3 60 

TAATCGATCA TGGAGAGATT CTTCACGTAA TTCTTTGTAA GGAGTTTCGT GAACATCGCA 4 20 

AAGTTGAATA GACACAATTC GGTCAGCAGG AATATTTTTG ATAGATTCTG CTGTTTGATT 480 

TGCTCTAGCC CAGTGCCAAG TAT CACAAAT CAGTTGCGCG TTATCTCGGC CACATGCTTC 54 0 

TGCCACACGC CATGCTGCTG CTAAATCTGC TACACCGCTA TAAGGCATGA ATTCTAAACC 600 

GATAATTAAT TCTTCAGCAC GGTCACAAAG TTCACCCAGA GCAGTAATGA TTTGTTCTTC 660 

CGGAATCTTT TCAAGTAAAC CACAGTTAAT ATGTTTTACG CCGAATAAGC GCGCCATATG 72 0 

AAAAGTGGTT TGTTCTTTCT TTTGTTGTTC TTTGGTGCGG TCGGAAGCGG TTCCCCATTG 7 80 

% CGTTATGTAT TCTACTTCTG TTACTTTGAT GTGATGTTCG TCCAAAATAC GCAACATATC 84 0 

TTCATCGGTT AAGCCTGCTG CAAGTGCATC AACATAGTTT TCTGCACGTA AACCAATTCC 900 

ATCAAAACCA TTTTCTGCTG CGATTCGTAC TCGTTCAGGA AAAGAAACCT CCGTTCCAAG 960 

CGTGTAAGAA CTAATCGTGA TGGGGCATTT TTTTAAGTCG CCATTTACAT TTGTCATAAA 1020 

AATTATCTCC TCTCTAGATT AAAATACAAG AAACTTTGAT AATAATTTCA CAATCACCAG 10 80 

CAAAAAAATA AATTCCTTTT TAGAATAAAA CGTCCTGAAG TGATAACGCA TTCAATCATT 1140 

GAAAATCTGA CTGGACAGTT TTCGAATTCT ACCGTATATT GGTTTTTAAA GGATAGTTTG 1200 

TCTCACTGGC TTAATTACAG TTTAATTGTA GTATGACTAT TCCATAAAAA CAAATTGGTA 12 60 

TTATTCTATT AATTGATAGA TAAATTGCAT AGATGCTTTT TAAAGAGGGG AGAAACCAT 1319 
(2) INFORMATION FOR SEQ ID NO: 36: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1319 base pairs 

(B) TYPE: nucleic acid 

J (C) STRANDEDNESS: single 



1 



fD) TOPOLOGY: linear 

(i'i) MOLECULE TYPE : DNA {genomic) 

(iv) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: L SEELIGERI 3327 D . F. 
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(xij SEQUENCE DESCRIPTION: 5EQ ID NO: 36: 



ATGGTTTCTC CCCTCTTTAA AAAGCATCTA TGCAATTTAT CTATCAATTA ATAGAATAAT 



60 



ACCAATTTGT TTTTATGGAA TAGTCATACT ACAATTAAAC TGTAATTAAG CCAGTGAGAC 



120 



AAACTATCCT TTAAAAACCA ATATACGGTA ^VTTCGAAA ACTGTCCAGT CAGATTTTCA 



130 



ATGATTGAAT GCGTTATCAC TTCAGGACGT TTTATTCTAA AAAGGAATTT ATTTTTTTGC 2 40 



TGGTGATTGT GAAATTATTA TCAAAGTTTC TTGTATTTTA ATCTAGAGAG GAGATAATTT 



300 



TTATGACAAA TGTAAATGGC G ACT T AAAAA AATGCCCCAT CAC ^ATTAGT TCTTACACGC 3 60 
TTGGAACGGA GGTTTCTTTT CCTGAACGAG TACGAATCGC AGCaGAAAAT GGTTTTGATG 4 20 



GAATTGGTTT ACGTGCAGAA AACTATGTTG ATGCACTTGC AGCAGGCTTA ACCGATGAAG 



480 



ATATGTT GCG TATTTTGGAC GAACATCACA TCAAAGTAAC AGAAGTAGAA TACATAACGC 54 0 



AATGGGGAAC CGCTTCCGAC CGCACCAAAG AACAACAAAA GAAAGAACAA ACCACTTTTC 



600 



ATATGGCGCG CTTATT CGGC GTAAAACATA TTAACTGTGG TTTACTTGAA AAGATTCCGG 



660 



AAGAACAAAT CATTACTGCT CTGGGTGAAC TTTGTGACCG TGCTGAAGAA TTAATTATCG 720 
GTTTAGAATT CATGCCTTAT AGCGGTGTAG CAGATTTAGC AGCAGCATGG CGTGTGGCAG 78 0 



AAGCATGTGG CCGAGATAAC GCGCAACTGA TTTGTGATAC TTGGCACTGG GCTAGAGCAA 84 0 



ATCAAACAGC AGAATCTATC AAAAATATTC CTGCTGACCG AATTGTGTCT ATTCAACTTT 



900 



GCGATGTTCA CGAAACTCCT TACAAAGAAT TACGTGAAGA ATCTCTCCAT GATCGATTAG 



960 



CACCTGGTGA AGGATACGGC GACACTGTAG GCTTTGCTCG TATTTTAAAA GAACATGGTG 1020 

TAAATCCGCG TGTTATGGGT GTGGAAGTGA TTTCAGACTC GAT GGTAGAA ACAGGTTTAG 10 80 

AATATGCGGC AATTAAAGTT TATAATGCTA CTAAAAAAGT ATTAGATGAA GCATGGCCAG 1140 

AAGTTTCTCC AAAGTAACTT GACGCATCGC TATGT GAAAT TAATAACGTT AAAAATCCCC 12 00 

CTTTGAAAGG AAAGTGTTTT TTTTGAAATT TAATTCGATG TTTTCTCCAA TAGATATCGG 12 60 

ACCTATGCGT GTGCCAAACC GTTTCGTGGT TTCGCCGATG TGTAATAACT ATGCAAATA 1319 



(2) 



INFORMATION FOR SEQ ID NO: 37: 



'9 



01 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: L WEISHIMERI 3359 D.F. 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ATTACACATT GGAGATACAA CAAACCGATT AGGTACTTTC ATCGGGCCAA TATCGATAGG 



60 



TGAGAACATC GAATTAAATT TCAAAAAAAA CACTTCCCTT TCAAGATGGA ATCTAGATTA 120 
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CGTTATTATT TTCACATGTT TGAATACATT ATTTTGGAGA AATTTCTGGC CAAGCCTCGT 130 

CTAATACTTT TTTCGTGGCA TTATATACTT TAATTGCAGC ATATTCTAGA CCAGTTTCAA 24 0 

CCATAGAGTC AG AT AT TACT TCGACTCCCA TCACTCGTGG ATTTACACCA TGTTCTTTTA 300 

GGATGCGTGC AAAACCAACT GTATCTCCAT ATCCTTCACC AGGTGGTAGT CGGTCATGAA 360 

GCGATTCTTC ACGAAGTTCT TTGTATGGCG TTTCATGAAC ATCACACAAT TGGATAGAAA 42 0 

CAATTCGATC AGCTGGGACA TTTTTGATAG ACTCTGCTGT TTGGTTTGCT CTTGCCCAGT 48 0 

GCCAAGTATC GCAAATTAGT TGTGCGTTAT CTCTACCACA TGCTTCTGCA ACACGCCAAG 54 0 

CTGCGGCTAA GTCTGCTACT CCGCTATACG GCATAAATTC TAAACCGATG ATTAATTCTT 600 

CGGCAC^ATC ACATAACTCA CCAAGAGCAG TAATTATTTG CTCTTCTGGA ATTTTTTCAA 660 

GTAAACCGCA GTTAATATGT TTTACACCGA ATAACCGAGC CATGTGGAAA GTGGTTTGCT 72 0 

CTTTTTGTTG TTGAGCATCG GTTCGGTCTG CTTCAGTTCC CCACTGAGTT ATGTATTCTA 7 80 

CTTCTGTTAC TTTGATATTA TGCTTGTCTA AAATCTGCAG CATGTCATCA TCAGTTAAGC 84 0 

CAGCTGCAAG AGCGTCTACA TAATTTTCAG CTCGCAAGCC AATTCCGTCA AAACCATTTT 900 

CTGCTGCAAT CTTTACACGT TCTGGGAAGG AAACCTCCGT TCCAAGTGTG TAAGAACTAA 960 

TCGTGATGGG GCATTTTTTT AAGTTGCCAT TTGAATTTGT CATAAAAATT ATCTCCTCTC 102 0 

AAGAATGTAA ATTACAAGAA ACTTTGATAA TATTTTCACA AACACCAGTA AAAAAATAAA 108 0 

TTCCTTTTAA TTAAAAATCG CTGATGTGAT AACGCCTTCA ATGAT CAAAA TACAACTGGA 114 0 

CAGTTAACGT ATTCTACCGT ATATTGGTTT TTAAGGAATA GTTTATTCTG CTGGTGTAAC 1200 

TACAGTTTAA TTGTATTATG ACTATTCCAT AAAAACAAAT TGGTATTATT CTATTAATTG 12 60 

ATAGATAAAT TGCATAGATA CTTTTTAATA AGGGGAGAAG CCAT 13 04 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

tii) i-i^idECULE TYPE: DNA (genomic) 

(iv) ANTI-SENSE: YES 

(vi) ORIGINAL' SOURCEi 

(B) STRAIN: L WEISHIMERI 3359 D.F. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ATGGCTTCTC CCCTTATTAA AAAGTAT CTA TGCAATTTAT CTATCAATTA ATAGAATAAT 60 

ACCAATTTGT TTTTATGGAA TAGTCATAAT ACAATTAAAC TGTAGTTACA CCAGCAGAAT 12 0 

AAACTATTCC TTAAAAACCA ATATACGGTA GAATACGTTA ACTGTCCAGT TGTATTTTGA 180 
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TCATTGAAGG CGTTATCACA TCAGCGATTT TTAATTAAAA GGAATTTATT TTTTTACTGG 24 0 



TGTTTGTGAA AATATTATCA AAGTTTCTTG TAATTTACAT TCTTGAGAGG AGATAATTTT 



300 



TATGACAAAT TCAAATGGCA ACTTAAAAAA ATGCCCCATC ACGATTAGTT CTTACACACT 3 60 



TGGAACGGAG GTTTCCTTCC CAGAACGTGT AAAGATTGCA GCAGAAAATG GTTTTGACGG 420 
AATTGGCTTG CGAGCTGAAA ATTATGTAGA CGCTCTTGCA GCTGGCTTAA CTGATGATGA 48 0 



CATGCTGCAG ATTTTAGACA AGCATAATAT CAAAGTAACA GAAGTAGAAT AC AT AAC T C A 54 0 



GTGGGGAACT GAAGCAGACC GAACCGATGC TCAACAACAA AAaGAGCAAA CCACTTTCCA 



600 



CATGGCTCGG ttattcggtg TAAAACATAT taactgcggt ttacttgaaa aaattccaga 



660 



agagcaaata attactgctc ttggtgagtt atgtgatcgt gccgaagaat taatcatcgg 



720 



TTTAGAATTT ATGCCGTATA GCGGAGTAGC AGACTTAGCC GCAGCTTGGC GTGTTGCAGA 7 80 



AGCATGTGGT AGAGATAACG CACAACTAAT TTGCGATACT TGGCACTGGG CAAGAGCAAA 



840 



CCAAACAGCA GAGTCTATCA AAAATGTCCC AGCTGATCGA ATTGTTTCTA TCCAATTGTG 



900 



TGATGTTCAT GAAACGCCAT ACAAAGAACT TCGTGAAGAA TCGCTTCATG ACCGACTACC 



960 



ACCTGGTGAA GGATATGGAG ATACAGTTGG TTTTGCACGC ATCCTAAAAG AACATGGTGT 1020 



■4 



AAATCCACGA GTGATGGGAG TCGAAGTAAT ATCTGACTCT ATGGTTGAAA CTGGTCTAGA 1080 
ATATGCTGCA ATTAAAGTAT ATAATGCCAC GAAAAAAGTA TTAGACGAGG CTTGGCCAGA 1140 



AATTTCTCCA AAATAATGTA TTCAAACATG TGAAAATAAT AACGTAATCT AGATTCCATC 12 00 
TTGAAAGGGA AGTGTTTTTT TTGAAATTTA ATTCGATGTT CTCACCTATC GATATTGGCC 1260 



CGATGAAAGT ACCTAATCGG TTTGTTGTAT CTCCAATGTG TAAT 



1304 



(2) 



INFORMATION FOR SEQ ID NO; 39: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 1328 base pairs 

(B) TYPE: nucleic acid 
■(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



vi) ORIGINAL SOURCE: 

(B) STRAIN : L IVANOVII 3340 D.F. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CCGTCTGTAT TTGCATAGTT ATTACACATT TGCGAAACAA CAAAACGGTT AGGTACTTTC 



60 



ATTGGTCCGA TATCTATTGG AGAAAACATC GAATTAAATT TCAAAAAAAA CACTTCCCTT 



120 



T CAAGAT AG A ATTTTTCTTT CGTTATTAAT TTCACATAGT ATTGTATCAG ATTATTTTGG 



180 



TGACACTTCG GGCCATGCTT CGTCTAATAC TTTTTTTGTG GCATTGTATA CTTTAATTGC 



240 



GGTATATTCT AAGCCAGTTT CTACCATGGA ATCAGATATT ACTTCTACTC CCATGACACG 3 00 
TGGACTTACA CCATGCTCTT TTAAAATACG AGCAAAACCA ATCGTGTCCC CGTATCCTTC 3 60 
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ACCAGGAGCT AGTCTATCAT GCAGTGACTC TTCGCGAAGC TCTTTATAGG GCGTTTCATG 42 0 
GACGTCACAG AGTTGAATTG AAACAATCCG ATCAGCAGGT ACATTTTTGA TAGATTCTGC 4 80 
TGTTTGGTTT GCTCTTGCCC AGTGCCATGT GTCACAAATT AGTTGGGCGT TATCTCTGCC 54 0 



GCAAGCCTCT GCCACACGCC ATGCTGCTGC CAAATCTGCT ACTCCGCTGT AAGGCATGAA 500 



.".it; 



TTCTAAACCA ATAATCAATT CTTCTGCACG GTCGCAAAGT TCACCAAGAG CAGTAATGAT 660 
TTGGTCTTCG GGGATTTTTT CCAATAAACC ACAATTAATA TGTTTTACAC CGAATAAGCG 72 0 
AGCCATGTGG AAGGTAGTTT GTTCTTTCTT TTGTTGCTCG AAAGTGCGGT CAGAAGCGGT 78 0 



TCCCCACTGC GTTATGTATT CTACTTCAGT AACTTTGATG TGATGCTCAT CTAAAATCC? 



840 



TAACATATCT TCATCAGTCA AGCCAGCTGC CAGTGCATCG ACATAATTTT CGGCGCGTAA 



900 



ACCAATTCCG TCAAAACCAT TTTCTGCTGC AATTCGTACT CGTTCAGGAA AAGAAACCTC 



960 



CGTTCCTAAG GTATAAGAGC TAATCGTGAT GGGGCATTTT TTTAGGTTGC CATTTGCATT 1020 
TGTCATAAAA ATTATCTCCT CTCTAGATTA AAACACAAGA AACTTTGATA ATGTTTTCAC 1080 
AATCACCAGC AAAAAAATAA AATCCATTCA CTTAGAAAAC TTTCTAATGT GAGAACGCAT 114 0 



:*8 



TCAATAGTTA GAAAATTGAC TGGACAGTTT TCACATTCTA CCGTATATTG GTTTTTAAAG 1200 



•0. 



GTTAGTTTAT TTCACTGGCA TAACTACTGT TTAATTGTAG TATGACTATT CCATAAAAAC 1260 



AAATTGGTAT TATTCTATTA ATCGATAGAT AAATTGCATA GATTATTTTT AACAAGGAGA 1320 



GAACCCAT 



1328 



(2) 



INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGYj linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 



(vi) ORIGINAL SOURCE: 

(B) STRAIN: L IVANOVII 3340 D.F. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

ATGGGTTCTC TCCTTGTTAA AAATAATCTA TGCAATTTAT CTATCGATTA ATAGAATAAT 60 

ACCAATTTGT TTTTATGGAA TAGTCATACT ACAATTAAAC AGTAGTTATG CCAGTGAAAT 120 

AAACTAACCT TTAAAAACCA ATATACGGTA GAATGTGAAA ACTGTCCAGT CAATTTTCTA 180 

ACTATTGAAT GCGTTCTCAC ATTAGAAAGT TTTCTAAGTG AATGGATTTT ATTTTTTTGC 24 0 

TGGTGATTGT GAAAACATTA TCAAAGTTTC TTGTGTTTTA ATCTAGAGAG GAGATAATTT 3 00 

T TAT G AC AAA TGCAAATGGC AACCTAAAAA AATGCCCCAT CACGATTAGC TCTTATACCT 3 60 
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TAGGAACGGA 


GG TTTCTTTT 


CCTGAACGAG 


TACGAATTGC 


AG C A G AAAAT 


GGTTTTGACG 


42C 


GAATTGGTTT 


ACGCGCCGAA 


AATTATGTCG 


ATGCACTGGC 


AGCTGGCTTG 


ACTGATGAAG 


490 


ATATGTTAAG 


GATTTTAGAT 


GAG CATC AC A 


TCAAAGTTAC 


TGAAGTAGAA 


TACATAACGC 


540 


AGTGGGGAAC 


CGCTTCTGAC 


CGCACTTTCG 


AGCAACAAAA 


GAAAGAACAA 


ACTACCTTCC 


500 


ACATGGCTCG 


CTTATTCGGT 


GTAAAACATA 


TTAATTGTGG 


TTTATTGGAA 


AAAATCCCCG 


6 50 


AAGACCAAAT 


CATTACTGCT 


CTTGGTGAAC 


TTTGCGACCG 


TGCAGAAGAA 


TTGATTATTG 


720 


GTTTAGAATT 


CATGCCTTAC 


AG CG GAG 7 AG 


CAGATTTGGC 


AGCAGCATGG 


CGTGTGGCAG 


780 


AGGCTTGCGG 


CAGAGATAAC 


GCCCAACTAA 


TTTGTGACAC 


ATGGCACTGG 


GCAAGAGCAA 


840 


ACCAAACAGC 


AGAATCTATC 


AAAAATGTAC 


CTGCTGATCG 


GATTGTTTCA 


ATTCAACTCT 


900 


GTGACGTCCA 


TGAAACGCCC 


TATAAAGAGC 


TTCGCGAAGA 


GTCACTGCAT 


GAT AG ACT AG 


960 


CTCCTGGTGA 


AGGATACGGG 


GACACGATTG 


GTTTTGCTCG 


TATTTTAAAA 


GAGCATGGTG 


1020 


TAAGTCCACG 


TGTCATGGGA 


GTAGAAGTAA 


TATCTGATTC 


CATGGTAGAA 


ACTGGCTTAG 


1080 


AATATACCGC 


AATTAAAGTA 


TACAATGCCA 


CAAAAAAAGT 


ATTAGACGAA 


GCATGGCCCG 


1140 


AAGTGTCACC 


AAAATAATCT 


GATACAATAC 


TATGTGAAAT 


TAATAACGAA 


AGAAAAATTC 


1200 


TATCTTGAAA 


GGGAAGTGTT 


TTTTTTGAAA 


TTTAATTCGA 


TGTTTTCTCC 


AATAGATATC 


1260 


GGACCAATGA 


AAGTACCTAA 


CCGTTTTGTT 


GTTTCGCAAA 


TGTGTAATAA 


CTATGCAAAT 


1320 


ACAGACGG 












1328 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
( D } TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: L MONO 647 ORE 

(Xii SEQUENCE DESCRIPTION: SEQ ID NO:41: 

Met Thr Asn Ala Asn Gly Asn Leu Lys Lys Cys Pro lie Thr lie Ser 
15 10 15 

Ser Tyr Thr Leu Gly Thr Glu Val Ser Phe Pro Lys Arg Val Lys Val 

20 25 30 

Ala Ala Glu Asn Gly Phe Asp Gly lie Gly Leu Arg Ala Glu Asn Tyr 
35 40 45 

Val Asp Ala Leu Ala Ala Gly Leu Thr Asp Glu Asp Met Leu Arg lie 
50 55 60 



Leu Asp Glu His Asn Met Lys Val Thr Glu Val Glu Tyr lie Thr Gin 
65 70 75 80 



58 



BNSDOCID: <WO„_982016OA1 J_> 



v** 



WO 98/20160 



PCTYUS97/19896 * 



Trp Gly Thr Ala Glu Asp Arg Thr Ala Glu Gin Gin Lys Lys Glu Gin 

85 90 95 



4 



Thr Thr Phe His Met Ala Arg Leu Phe Gly Val Lys His lie Asn Cvs 

100 105 110 

Gly Leu Leu Glu Lys lie Pre ^ u Glu Gin lie He Val Ala Leu Gly 
115 120 125 



Glu Leu Cys Asp Arg Ala Glu Glu Leu lie lie Gly Leu Glu Phe Met 

130 135 140 

Pro Tyr Ser Gly Val Ala Asp Leu Gin Ala A . a Trp Arg Val Ala Glu 

145 150 155 160 



I 

4 



Ala Cys Gly Arg Asp Asn Ala Gin Leu lie Cys Asp Thr Trp His Trp 

165 170 175 

Ala Arg Ala Asn Gin Thr Ala Glu Ser lie Lys Asn Val Pro Ala Asp 

180 185 190 

Arg He Val Ser He Gin Leu Cys Asp Val His Glu Thr Pro Tyr Lys 
195 200 205 

Glu Leu Arg Glu Glu Ser Leu His Asp Arg Leu Ala Pro Gly Gl»i Gly 
210 215 220 

Tyr Gly Asp Thr Val Gly Phe Ala Lys He Leu Lys Glu His Gly Val 
225 230 235 240 

Asn Pro Arg Val Met Gly Val Glu Val He Ser Asp Ser Met Val Ala 

245 250 255 

Thr Gly Leu Glu Tyr Ala Ala Leu Lys Val Tyr Asn Ala Thr Lys Lys 

260 265 270 

Val Leu Asp Glu Ala Trp Pro Glu He Ser Pro Arg 
275 280 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

■(C) STRANDEDNESS: unknown 
<D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: L MONO 4450 ORF 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Thr Asn Ala Asn Gly Asp Leu Lys Lys Cys Pro He Thr He Ser 
15 10 15 

Ser Tyr Thr Leu Gly Thr Glu Val Ser Phe Pro Glu Arg Val Arg He 

20 25 30 



Ala Ala Glu Asn Gly Phe Asp Gly lie Gly Leu Arg Ala Glu Asn Tyr 
35 40 45 
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Val Asp Ala Leu Ala Ala Gly Leu Thr Asd Glu Asp Met Leu Arg lie 
50 55 * 60 

Leu Asp Glu His Asn lie Lys Val Thr Glu Val Glu Tyr lie Thr Glr. 
65 70 75 80 

Trp Gly Thr Ala Glu Asp Arg Thr Ala Glu Gin Gin Lys Lys Glu Gin 

85 90 95 

Thr Thr Phe His Met Ala Arg Leu Phe Gly Val Lys His lie Asn Cys 

100 105 110 

Gly Leu Leu Glu Lys lie Pro Glu Glu Gin lie lie Thr Ala Leu Gly 
115 120 125 

Glu Leu Cys Asp Arg Ala Glu Glu Leu lie He Gly Leu Glu Phe Met 
130 135 140 

Pro Tyr Ser Gly Val Ala Asp Leu Ala Ala Ala Trp Arg Val Ala Glu 
145 150 155 160 

Ala Cys Gly Arg Asp Asn Ala Gin Leu He Cys Asp Thr Trp His Ser 

165 170 175 

Ala Arg Ala Asn Gin Thr Ala Glu Ser He tys Asn Val Pro Ala Asp 

180 185 190 



Arg He Val Ser He Gin Leu Cys Asp Val His Glu Thr Pro Tyr Lys 
h 195 200 205 



Glu Leu Arg Glu Glu Ser Leu His Asp Arg Leu Ala Pro Gly Glu Gly 
210 215 220 

Tyr Gly Asp Thr Val Gly Phe Ala Arg He Leu Lys Glu His Gly Val 
225 ' 230 235 240 

Asn Pro Arg Val Met Gly Val Glu Val He Ser Asp Ser Met Val Glu 

245 250 255 

Thr Gly Leu Glu Tyr Ala Ala He Lys Val Tyr Asn Ala Thr Lys Lys 

260 265 270 

Val Leu Asp Gin Ala Trp Pro Glu He Ser Pre Lys 
275 280 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN : L MONO 33 4 0 ORF 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Thr Asn Ala Asn Gly Asn Leu Lys Lys Cys Pro He Thr He Ser 
15 10 15 
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5er Tyr Thr Leu Gly Thr Glu Val Ser Phe Pro Glu Arg Val Arg lie 

20 25 30 

Ala Ala Glu Asn Gly Phe Asp Gly lie Gly Leu Arg Ala Glu Asn Tyr 
35 40 45 

Val Asp Ala Leu Ala Ala Gly Leu Thr Asp Glu Asp Met Leu Arg lie 
50 55 60 

Leu Asp Glu His His He Lys Val Thr Glu Val Glu Tyr He Thr Gin 
65 70 75 80 

Trp Giy Thr Ala Ser Asp Arg Thr Phe Glu Gin Gin Lys Lys Glu Glr. 

85 90 95 

Thr Thr Phe His Met Ala Arg Leu Phe Gly Val Lys His He Asn Cys 

100 105 110 

Gly Leu Leu Glu Lys He Pro Glu Asp Gin He He Thr Ala Leu Gly 
115 120 125 

Glu Leu Cys Asp Arg Ala Glu Glu Leu He He Gly Leu Glu Phe Met 
130 135 140 

Pro Tyr Ser Gly Val Ala Asp Leu Ala Ala Ala Trp Arg Val Ala Glu 
145 150 155 160 

Ala Cys Gly Arg Asp Asn Ala Gin Leu He dys Asp Thr Trp His Trp 

165 170 175 

Ala Arg Ala Asn Gin Thr Ala Glu Ser He Lys Asn Val Pro Ala Asp 

180 185 190 

Arg He Val Ser He Gin Leu Cys Asp Val His Glu Thr Pro Tyr Lys 
195 200 205 

Glu Leu Arg Glu Glu Ser Leu His Asp Arg Leu Ala Pro Gly Glu Gly 
210 215 220 

Tyr Gly Asp Thr He Gly Phe Ala Arg He Leu Lys Glu His Gly Val 
225 230 235 '40 

Ser Pro Arg Val Met Gly Val Glu Val He Ser Asp Ser Met Val Glu 

245 250 255 

Thr Gly Leu Glu Tyr Thr Ala lie Lys Val Tyr Asn Ala Thr Lys Lys 

260 265 270 

^ Val Leu Asp Glu Ala Trp Pro Glu Val Ser Pro Lys 

275 280 

(2) INFORMATION FOR SEQ ID NO: 44: 



if'. 



>5A 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY : unknown 

ii) MOLECULE TYPE: protein 

vi) ORIGINAL SOURCE: 

(B) STRAIN: L MONO '3327 ORF 
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(XL) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Thr Asn Val Asn Gly Asp Leu Lys Lys Cys Pro He Thr He Se: 
15 10 15 

Ser Tyr Thr Leu Gly Thr Glu Val Ser Phe Pro Glu Arg Val Arg lie 

20 25 30 

Ala Ala Giu Asn Gly Phe Asp Gly He Gly Leu Arg Ala Glu Asn Tyr 
35 40 45 

Val Asp Ala Leu Ala Ala Gly Leu Thr Asp Glu Asp Met Leu Arg He 

50 55 60 

Leu Asp Glu His His He Lys Val Thr Glu Val Glu Tyr He Thr Gin 
65 70 75 30 

Trp Gly Thr Ala Ser Asp Arg Thr Lys Glu Gin Gin Lys Lys Glu Gin 

35 90 95 

Thr Thr Phe His Met Ala Arg Leu Phe Gly Val Lys His He Asn Cys 

100 105 110 

Gly Leu Leu' Glu Lys He Pro Glu Glu Gin He He Thr Ala Leu Gly 
115 120 125 

Glu Leu Cys Asp Arg Ala Glu Glu Leu He lie Gly Leu Glu Phe Met 
| 130 135 140 

$ Pro Tyr Ser Gly Val Ala Asp Leu Ala Ala Ala Trp Arg Val Ala Glu 

145 150 X55 160 

Ala Cys Gly Arg Asp Asn Ala Gin Leu He Cys Asp Thr Trp His Trp 

165 170 175 

Ala Arg Ala Asn Gin Thr Ala Glu Ser lie Lys Asn He Pro Ala Asp 

180 185 190 

Arg He Val Ser He Gin Leu Cys Asp Val His Glu Thr Pro Tyr Lys 
195 200 205 

Glu Leu Arg Glu Glu Ser Leu His Asp Arg Leu Ala Pro Gly Glu Gly 
210 215 220 

Tyr Gly Asp Thr Val Gly Phe Ala Arg He Leu Lys Glu His Gly Val 
225 230 235 240 

Asn Pro Arg Val Met Gly Val Glu Val He Ser Asp Ser Met Val Glu 

245 250 255 

Thr Gly Leu Glu Tyr Ala Ala He Lys Val Tyr Asn Ala Thr Lys Lys 

260 265 270 

S Val Leu Asp Glu Ala Trp Pro Glu Val Ser Pro Lys 

275 280 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
$ ( D) TOPOLOGY: unknown 
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(u) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN : L MONO 3359 ORF 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 45: 



$ Met Thr Asn Ser Asn Gly Asn Leu Lys Lys Cys Pro lie Thr lie Ser 

1 5 10 15 

M: Ser Tyr Thr Leu Gly Thr Glu Val Ser Phe Pro Glu Arg Val Lys lie 

S 20 25 30 



Ala Ala Glu Asn Gly Phe Asp Gly He Gly Leu Arg Ala Glu Asn Tyr 
35 40 45 

Val Asp Ala Leu Ala Ala Gly Leu Thr Asp Asp Asp Met Leu Gin He 
50 5'. * 60 

^ Leu Asp Lys His Asn He Lys Val Thr Glu Val Glu Tyr He Thr Gin 

65 70 75 80 

Trp Gly Thr Glu Ala- Asp Arg Thr Asp Ala Gin Gin Gin Lys Glu Gin 

85 90 95 

Thr Thr Phe His Met Ala Arg Leu Phe Gly Val Lys His He Asn Cys 

100 105 110 

$ 

$i Gly Leu Leu Glu Lys He Pro Glu Glu Gin He He Thr Ala Leu Gly 

^ 115 120 125 

Glu Leu Cys Asp Arg Ala Glu Glu Leu He lie Gly Leu Glu Phe Met 

;.■ 130 135 140 

Pro Tyr Ser Gly Val Ala Asp Leu Ala Ala Ala Trp Arg Val Ala Glu 
145 150 155 160 

Ala Cys Gly Arg Asp Asn Ala Gin Leu He Cys Asp Thr Trp His Trp 

165 170 175 

Ala Arg Ala Asn Gin Thr Ala Glu Ser He Lys Asn Val Pro Ala Asp 

180 185 190 

Arg He Val Ser He Gin Leu Cys Asp Val His Glu Thr Pro Tyr Lys 
195 200 205 

Glu Leu Arg Glu Glu Ser Leu His Asp Arg Leu Pro Pro Gly Glu Gly 
210 215 220 

' Tyr Gly Asp Thr Val Gly Phe Ala Arg He Leu Lys Glu His Gly Val 

225 230 235 240 



Asn Pro Arg Val Met Gly Val Glu Val He Ser Asp Ser Met Val Glu 

245 250 255 

Thr Gly Leu Glu Tyr Ala Ala He Lys Val Tyr Asn Ala Thr Lys Lys 

260 265 270 

Val Leu Asp Glu Ala Trp Pro Glu He Ser Pro Lys 

275 280 
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(2) INFORMATION FOR SEQ ID NO : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 : 

GCGATACGAC AAATCTGTTA GGCACC 

(2} INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

GAGATACAAC AAAACGATTT GGTACT 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID' NO: 48: 

GCGAAACCAC GAAACGGTTT GGCACA 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

GAGATACAAC AAACCGATTA GGTACT 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
'D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

GCGAAACAAC AAAACGGTTA GGTACT 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 
£j (B) TYPE: nucleic acid 

1 (C) STRANDEDNESS : single 

I (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CATTCCTTTC ACAGGGAGTC TTCCTAC 2 7 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE; nucl eic acid 

■ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
CTTCCCTTTC AAGGTAGAAT CTTCTTGT 28 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

CTTTCCTTTC AAAGGGGGAT TTTTA 2 5 

4. ( 2 J INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 
I (B) TYPE: nucleic acid 

£ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

CTTCCCTTTC AAGATGGAAT C TAG AT 2 6 
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(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CTTCCCTTTC AAGATAGAAT TTTTTCTT 2 8 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TAGTTGGATG GAAACAATCC GAT C AG 26 



H {2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

TAGTTGGATA GAAACAATCC GATCAG 2 6 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 2 6 base pairs 
(3) TYPE: 'nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
TAATTGGATG GAAACAATCC GGTCAG 2 6 



(2) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
4 (D) TOPOLOGY: linear 
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11) MOLECULE TYPE: DNA (genomic) 

fxii SEQUENCE DESCRIPTION: S£Q ID NO: 59: 

AAGTTGAATA GACACAATTC GGTCAG ?6 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 
I (3) TYPE: nucleic acid 

f! (C) STRANDEDNESS: single 

1 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

CAATTGGATA GAAACAATTC GAT C AG 2 6 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GAGTTGAATT GAAACAATCC GAT C AG 2 6 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
TTTGTTGTTC TGCTGTACGA TCTTCGG 27 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) . SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TTTGCTGTTC GGCTGTGCGG TCCTCGG 2 7 
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(2j INFORMATION FOR SEQ ID NO: 64: 

(ii SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
TTTGTTGTTC TTTGGTGCGG TCGGAAG 2 7 

(2) INFORMATION FOR SEQ ID. NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GTTGTTGAGC ATCGGTTCGG TCTGCTT 27 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

TTTGTTGCTC JAAAGTGCGG TCAGAAG 27 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AAATCCCTGA GGAACAAATC ATCGTC 2 6 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : DNA (genomic) 

(xij SEQUENCE DESCRIPTION: SEQ ID NO:68: 
AAATCCCTGA GGAACAAATC ATTGTC 2 5 

(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

, (A) LENGTH: 2 6 base pairs 

$ { B ) TYPE: nucleic acid 

I (C) STRANDEDNESS: single 

? (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (.genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAATCCCCGA AGAACAAATC ATTACG 2 6 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
AGATTCCGGA AGAACAAATC ATTACT 2 6 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
AAATTCCAGA AGAGCAAATA ATTACT 2 6 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
AAATCCCCGA AGACCAAATC ATTACT 2 6 
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(2) INFORMATION FOR SEQ ID NO: 73: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 
{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
GGTAGAATAG GTTAACTGTC CAGTTCC 2 7 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 be-- pairs 

(B) TYPE; nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GGTAGAATAG GTTAACTGTC CAGTTGA 2 7 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GGTAGAATTC GAAAACTGTC CAGTCAG 27 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
GGTAGAATAC GTTAACTGTC CAGTTGT 2 7 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) 



MOLECULE TYPE : DNA (genomic) 



(xi ) 



SEQUENCE DESCRIPTION: SEQ ID NO; 77: 



GGTAGAATGT GAAAACTGTC CAGTCAA 



27 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
TACAATTAGA CTGTAGTTAC GCCAGTGA 2 8 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
TACAATTAAA CTGTAGTTAC GCCAGTGA 2 8 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TACAATTAGA CTGTAGTTAC ACCAGCAG 2 8 

(2) INFORMATION FOR SEQ ID NO: 81: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
TACAATTAAA CTGTAATTAA GCCAGTGA 2 8 
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IN FORMAT IOW FOR SEQ ID MO: 82: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: 



DNA (genomic) 



(xi) 



SEQUENCE DESCRIPTION: 



SEQ ID NO: 82 : 



TACAATTAAA CTGTAGTTAC ACCAGCAG 



28 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
TACAATTAAA CAGTAGTTAT GCCAGTGA 2 8 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
TTTGATAGAT TCTGCTGTTT GGTTTGCTCT 3 0 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
TTTGATAGAT TCAGCTGTTT GATTTGCTCT 30 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(2) 



INFORMATION FOR SEQ ID NO: 84: 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: 5EQ ID NO: 86: 
TTTGATTGAT TCTGCTGTTT GGTTTGCTCT 3 0 

(2) INFORMATION FOR SEQ ID NO: 87: 



(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
£f (D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
TTTGATAGAT TCTGCTGTTT GATTTGCTCT 3 0 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
: : : TTTGATAGAC TCTGCTGTTT GGTTTGCTCT 30 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

TGCTGTTTGG TTTGCTCTAG CCCAGTGCCA 30 

g (2) INFORMATION FOR SEQ ID NO; 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 
& (B) TYPE: nucleic acid 

¥ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

AGCTGTTTGA TTTGCTCTAG CCCAGTGCCA 30 
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(2) INFORMATION FOR SEQ ID NO: 91: 

{1} SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
TGCTGTTTGG TTTGCTCTAG CCGAGTGCCA 3 0 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
TGCTGTTTGA TTTGCTCTAG CCCAGTGCCA 3 0 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TGCTGTTTGG TTTGCTCTTG CCCAGTGCCA 30 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: li; ear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
TTGCATTTGT CATAAAAATT ATCTCCTCTC 3 0 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 95: 
TTACATTTGT CATAAAAATT ATCTCCTCTC 3 0 

(2) INFORMATION FOR SEQ ID NO: 96: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 
#i (B) TYPE: nucleic acid 



1 (C) STRANDEDNESS: single 

>1 (D) TOPOLOGY: linear 



{ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
TTGAATTTGT CATAAAAATT ATCTCCTCTC 3 0 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CGCTGCGGAA AACGGTTTTG ACGGATTTGG 30 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CGCTGCGGAA AAAGGTTTTG ACGGATTTGG 30 
^ (2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 
| (B) TYPE: nucleic acid 

% (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
CGCAGCAGAA AACGGTTTTG ATGGATTTGG 30 
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(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
A (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

IS TGCAGCAGAA AACGGTTTTG ACGGATTTGG 30 

(2) INFORMATION FOR SEQ ID NO: 101: 

".t (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base .^irs 

(Bj TYPE: nucleic acid 
■i (C) STRANDEDNESS; single 

J. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

AAAAAAATGC CCCATCACGA TTAGCTCTTA 30 

(2) INFORMATION FOR SEQ ID NO: 102: 



■Vi 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
AAAAAAATGC CCAATCACGA TCAGCTCTTA 3 0 

(2) INFORMATION FOR SEQ ID NO: 103: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
$ (D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
AAAAAAATGC CCCATCACGA TTAGTTCTTA 30 
(2) INFORMATION FOR SEQ ID NO: 104: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
^ (D) TOPOLOGY: linear 
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4| (B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE: DNA (genomic) 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
AAATGCAAAT GGCAACCTAA AAAAATGCCC 3 0 

(2) INFORMATION EOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 



M 

"4 (D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA (genomic) 
3, ■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

. AAATGCAAAT GGCAATCTAA AAAAATGCCC 3 0 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

■i (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

i 

: AAATGCAAAT GGCGACCTAA AAAAATGCCC 30 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
AAATGTAAAT GGCGACTTAA AAAAATGCCC 30 
3? (2) INFORMATION FOR SEQ ID NO: 108: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 
:| (B) TYPE: nucleic acid 

; i (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
AAATTCAAAT GGCAACTTAA AAAAATGCCC 30 
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(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
CAATACCAAT TTGTTTTTAT GGAATAGTCA 3 0 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TAATACCAAT TTGTTTTTAT GGAATAGTCA ' 30 
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CLAIMS 

What is claimed is: 

1 . A method for determining whether an unknown bacterium is a 
Listeria monocytogenes, comprising 
5 (A) amplifying genomic DNA from (i) a positive test panel of 

Listeria monocytogenes strains and (ii) a negative test panel 
of non- monocytogenes Listeria strains with a primer derived 
from a pre-marker diagnostic fragment for Listeria 
monocytogenes selected from the group of nucleic acids 
10 corresponding to SEQ ID NOS; 17, 18, and 19 to yield a 1300 

bp diagnostic fragment for each of the positive and negative 
test panels; 

(B) selecting at least one Listeria monocytogenes diagnostic 
marker contained within the diagnostic fragment by 

15 comparing the diagnostic fragment obtained from the 

amplification of the positive test panel with the diagnostic 
fragment obtained from the amplification of the negative test 
panel whereby at least one highly conserved region in the 
diagnostic fragment of the positive test panel is identified 

20 which is less than 90% homologous to any member of the 

negative test panel; 

(C) designing at least one amplification primer corresponding to 
the at least one diagnostic marker identified in 
step (B); and 

25 (D) amplifying genomic DNA of the unknown bacterium under 

suitable annealing temperatures with the at least one 
amplification primer of step (C), whereby obtaining at least 
one amplification product indicates that the unknown 
bacterium is a Listeria monocytogenes. 
30 2. The method of Claim 1 v/herein the Listeria monocytogenes pre- 

marker diagnostic fragment is selected from the group consisting of nucleic acids 
corresponding to SEQ ID NOS:20-23. 

3. The method of Claim 1 wherein the diagnostic fragment is at least 
83% homologous to SEQ ED NOS:24-3 1 and 33-40. 
35 4. The method of Claim 1 wherein the diagnostic fragment is selected 

from the group of consisting nucleic acids corresponding to SEQ ID NOS: 24-3 1 
and 33-40. 
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5. The method of Claim 1 wherein at least one diagnostic marker 
selected in step (B) is selected from the group consisting of nucleic acids 
corresponding to SEQ ID NOS 46-83. 

6. The method of Claim 1 wherein the at least one amplification primer 
5 is about 15 to 30 bp in length. 

7. The method of Claim i wherein the suitable annealing temperature is 
in the range of about 60 °C to 70 °C. 

8. A method for determining whether an unknown bacterium is a 
member of the genus Listeria, comprising 

10 (A) amplifying genomic DNA from (i) a positive test panel of 

Listeria monocytogenes strains and (ii) a negative test panel 
of non-monocytogenes Listeria strains with a primer derived 
from a pre-marker diagnostic fragment for Listeria 
monocytogenes strains selected from the group consisting of 

15 nucleic acids corresponding to SEQ ID NOS:17, 18, and 19 

to yield a 1300 bp diagnostic fragment for each of the positive 
and negative test panels; 

(B) selecting at least one Listeria genus-specific diagnostic 
marker contained within the diagnostic fragment by 

20 comparing the diagnostic fragment obtained from the 

amplification of the positive test panel with the diagnostic 
fragment obtained from the amplification of the negative test 
panel whereby at least one highly conserved region in the 
diagnostic fragment of the positive test panel is identified 

25 which is at least 90% homologous to the corresponding 

positive test panel of diagnostic fragment; 

(C) designing amplification primers corresponding to the at least 
one Listeria genus-specific diagnostic marker selected in step 
(B); and 

30 (D) amplifying genomic DNA of the unknown bacterium under 

suitable annealing temperatures with the amplification primers 
of step (C), whereby obtaining amplification products 
indicates that the unknown bacterium is a member of the 
genus Listeria. 

35 9. The method of Claim 8 wherein at step (A) the diagnostic fragment is 

at least 83% homologous to any one of SEQ ED NOS:24-3 1 and 33-40. 
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10 The method of Claim 8 wherein at step (A) the diagnostic fragment is 
selected from the group consisting of nucleic acids corresponding to SEQ ID 
NOS:24-31 and 33-40. 

1 1 . The method of Claim 8 wherein the Listeria monocytogenes pre- 

5 marker diagnostic fragment is selected from the group consisting of nucleic acids 
corresponding to SEQ ID NOS:20-23. 

12. The method of Claim 8 wherein the at least one diagnostic marker 
selected in step (B) is selected from the group consisting of nucleic acids 
corresponding to SEQ ID NOS:84-110. 

10 13. The method of Claim 8 wherein the at least one amplification primer 

is about 15 to 30 bp in length. 

14. The method of Claim 8 wherein the suitable annealing temperature is 
in the range of about 60 °C to 70 °C. 

15. A method for determining whether an unknown bacterium is a 

15 Listeria monocytogenes, comprising contacting the genomic DNA of the unknown 
bacterium with a nucleic acid probe selected from the group consisting of nucleic 
acids corresponding to SEQ ID NOS:46-83, and then detecting hybridization of 
the nucleic acid probe with the genomic DNA. 

16. A method for determining whether an unknown bacterium is a 

20 Listeria monocytogenes comprising contacting the genomic DNA of the unknown 
bacterium with a nucleic acid probe selected from the group consisting of nucleic 
acids corresponding to SEQ ID NOS:84-l 10, and then detecting hybridization of 
the nucleic acid probe with the genomic DNA. 

17. Isolated nucleic acid fragments selected from the group consisting of 
25 nucleic acid fragments corresponding to SEQ ID NOS: 17 through 1 10. 

18. An isolated nucleic acid fragment encoding the amino acid sequence 
as given in any one of SEQ ID NOS:32 and 41-45. 

19. A nucleic acid fragment located on a diagnostic fragment of about 
1300 bp and selected from the group consisting of nucleic acid fragments 

30 designated 

1515(rc341x2)-26-363, 
1515(rc341x2)-27-281, 
1515-26-36, 
1515-27-357, 
35 1515-26-rc233, 

1515(8585)-27-rc737, 

1515(8585)-28-rc793 

1515-30-76, 

81 



WO 98/20160 J : J PCT/US97/ 19896 

1515-30-88, 
1515(8585)-30-624, 
I515(8585)-30-rc483, 
15l5(8SS5)-30-rc555, 
5 1515(8585)-30-rc573, 

15I5(8585)-30-rc824 1 
the diagnostic fragment characterized by 

(A) at least 83% homology to any one of SEQ ID NOS:24-3 1 and 
33-40; and 

10 (B) an open reading frame of about 855 bp contained within the 

diagnostic fragment, the open reading frame encoding an 
amino acid sequence of any one of SEQ ED NOS:32 and 41 - 
45. 
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